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(57) Abstract 



Genetic polymorphisms are identified in the human UGT1 gene that alter UGT1 -dependent drug metabolism. Nucleic acids comprising 
the polymorphic sequences are used to screen patients for altered metabolism for UGT1 substrates, potential drug-drug interactions, and 
adverse/side effects, as well as diseases that result from environmental or occupational exposure to toxins. The nucleic acids are used to 
establish animal, cell and in vitro models for drug metabolism. 
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GENOTYPING THE 
HUMAN UDP-GLUCURONOSYLTRANSFERASE 1 (UGT1) GENE 

Introduction 

The metabolic processes commonly involved in the biotransformation of xenobiotics 
have been classified into functionalization reactions (phase I reactions), in which lipophilic 
compounds are modified via monooxygenation, dealkylation, reduction, aromatization, or 
hydrolysis. These modified molecules can then be substrates for the phase II reactions, 
often called conjugation reactions, as they conjugate a functional group with a polar, 
endogenous compound. Drug glucuronidation, a major phase II conjugation reaction in the 
mammalian detoxification system, is catalyzed by the UDP-glucuronosyltransferases 
(UGTs) (Batt AM, et aL (1994) Clin Chim Acta 226:171-190; Burchell et al. (1995) Life Sci. 
57:1819-31). 

The UGTs are a family of enzymes that catalyze the glucuronic acid conjugation of 
a wide range of endogenous and exogenous substrates including phenols, alcohols, 
amines and fatty acids. The reactions catalyzed by UGTs permit the conversion of a large 
range of toxic endogenous/xenobiotic compounds to more water-soluble forms for 
subsequent excretion (Parkinson A (1996) Toxicol Pathol 24:48-57). 

The UGT isoenzymes are located primarily in hepatic endoplasmic reticulum and 
nuclear envelope (Parkinson A (1996) Toxicol Pathol 24:48-57), though they are also 
expressed in other tissues such as kidney and skin. UGTs are encoded by a large 
multigene superfamiiy that has evolved to produce catalysts with differing but overlapping 
substrate specificities. Three families, UGT1, UGT2, and UGT8, have been identified 
within the superfamiiy. UGTs are assigned to one of the subfamilies based on amino acid 
sequence identity, e.g., UGT1 family members have greater than 45% amino acid 
sequence identity (Mackenzie et al. 1997) Pharmacogenetics 7:255-69). 

The UGT1 locus is located on chromosome 2q37, and contains at least 12 
promoters/first exons, which are apparently able to splice with common exons 2 through 5, 
producing gene products having strikingly different N-terminal halves (amino acid sequence 
identities ranging from 24% to 49%), but identical C-terminal halves (Fig. 1). At least eight 
different isoenzymes are encoded by the UGT1 locus; at least one or more first exons 
encode pseudogenes. The different N-terminal halves encoded by the first exons confer 
different substrate binding specificities upon the UGT1 isoenzymes, while exons 2-5, which 
are present in all UGT1 isoenzyme mRNAs, encode the UDP-glucuronic acid binding 
domain, membrane anchorage site, and ER retention signal that are common to all UGT 
proteins (Ritter et al. (1992) J Biol Chem 267:3257-3261). UGT1 locus isoenzymes are 
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best known for their role in glucoronidation and metabolism of many substrates, including 
bilirubin (1A1, 1D1), planar and non-planar phenols, naphthols (1F1) (Ouzzine M, et al. 
C" 4 ) Arch Biochem Biophys 310:196-204), anthraquinones, flavones, aliphatic alcohols, 
aromatic carboxylic acids, and steroids (Ebner T, et al. (1993) Drug Metab Dispos 21:50- 
55). 

In addition to UGT1 exon usage, metabolism of endogenous and exogenous 
substrates can also be affected by competitive binding phenomena. For example, in some 
cases exogenous substrates for the UGT1 enzymes have a higher binding affinity or avidity 
for the enzyme than the endogenous UGT1 substrates. For example, UGT1*1, the major 
bilirubin-metabolizing form of UGT1, more readily binds both octyl-gallate and emodin than 
it binds bilirubin, thus indicating the potential of these xenobiotics to cause jaundice by 
inhibition of bilirubin binding to UGT1*1 (where 1*1 indicates that the first exon is used in 
the spliced gene product). UGT1M is also responsible for glucuronidation of the oral 
contraceptive ethinylestradiol (Ebner et al. (1993) Mol. Pharmacol 43:649-54), and can 
also glucuronidate phenols, anthroquinones, flavones, and certain endogenous steroids. 

As noted above, the first exon present in UGT1 can affect substrate binding 
specificity of the UGT1 gene product (for a review, see Burchell (1995) Life Sci. 57:1819- 
31). For example, UGT1*2 accepts a wide range of compounds as substrates including 
non-planar phenols, anthraquinones, flavones, aliphatic alcohols, aromatic carboxylic acids, 
steroids (4-hydroxyestrone, estrone) and many drugs of varied structure (Ebner et al. 
(1993) Drug. Metab. Disp. 21:50-5; Burchell (1995) Life Sci. 57:1819-31). In contrast, 
UGT1*6 exhibits only limited substrate specificity for planar phenolic compounds relative to 
other human UGTs. 

Polymorphisms can markedly affect binding of the endogenous substrate, which can 
be manifested as clinical syndromes. At least two conditions, Crigler-Najjar syndrome and 
Gilbert syndrome, are associated with UGT1 polymorphisms. Both of these syndromes are 
hereditary and are associated with predominantly unconjugated hyperbilirubinemia. Crigler- 
Najjar syndrome is associated with intense, persistent jaundice which begins at birth. 
Some affected infants die in the first weeks or months of life with kemicterus; others survive 
with little or no neurologic defect. Crigler-Najjar syndrome is caused by a defect in the 
ability of UGT1 to catalyze UDP-glucuronidation of bilirubin, resulting in accumulation of 
bilirubin in the blood (Erps et al. (1994) J. Clin. Invest 93:564-70). Gilbert syndrome is a 
benign mild form of unconjugated hyperbilirubinemia that is characterized by normal liver 
function tests, normal liver histology, delayed clearance of bilirubin from the blood, and mild 
jaundice that tends to fluctuate in severity. As with Crigler-Najjar syndrome, Gilbert 
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syndrome is associated with a defect in UGT1. Specific UGT polymorphisms that are 
known to be associated with disease are indicated in Fig. 1 . 

Alteration of the expression or function of UGTs may also affect drug metabolism. 
For example, there may be common polymorphisms in the human UGT1 gene that alter 
expression or function of the protein product and cause drug exposure-related phenotypes. 
Thus, there is a need in the field to identify UGT1 polymorphisms in order to provide a 
better understanding of drug metabolism and the diagnosis of drug exposure-related 
phenotypes. 

Relevant Literature 

Genbank accession number M84122 provides UGT1 exon 2, M84123 provides 
exons 3 and 4, M84124 provides 5, M84125 provides exon 1A, M84127 provides exon 1C, 
M84128 provides exon 1D, M84129 provides exon 1E, M84130 provides exon 1F, U39570 
provides exon 1G, U42604 provides exon 1H, U39550 provides exon 1 J. 

The UGT gene superfamily and recommended nomenclature for describing UGT 
genes and alleles are reviewed in Mackenzie et al. (1997) Pharmacooenet 7:255-69. 

The two UGT1A6 genetic polymorphisms are described in Ciotti et al. (1997) Am. J. 
Hum. Genet. 61(Supp):A249. The identification of Asp446 as a critical residue in UGT1 is 
described in Iwano et al. (1997) Biochem. J 325:587-91. 

A review of the substrate specificity of human UDP-glucuronosyltransferases is 
provided by Burchell et al. (1995) Life Sci. 57:1819-31 . For a review of drug 
glucoronidation in humans, see Miners et al. (1991) Pharmacol Thpr 51:347-69. 

At least twelve UGT1A1 polymorphisms have been identified and linked to disease. 
These UGT1A1 alleles, each of described in OMIM Entry 191740 (at 
http://www.ncbi.nlm.nih.gov/htbin-post/Omim/dispmim7191740) and in OMIM Entry 143500 
(at http://www.ncbi.nlm.nih.gov/htbin-post/Omim/dispmim7143500), include: 

1) UGT1*FB (UGT1A1, 13-BP DEL, EX2; 191740.0001), which contains a 13 bp 
deletion in exon 2 and is associated with Crigler-Najjar syndrome type I (CN-I); 

2) UGT1A1, EXON4, C-T, SER-PHE (191740.0002), which contains a C-to-T 
transition in exon 4 (resulting in an amino acid change from serine to phenylalanine) is 
associated with CN-I and deficiency of both bilirubin-UGT and phenol-UGT activities in the 
liver; 

3) UGT1A1, GLN331TER (191740.0003), which contains a C-to-T transition 6 bp 
upstream from the 3-prime end of exon 2 of the common region (replacement of a 
glutamine codon with a stop codon), is associated with CN-I; 
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4) UGT1A1 , ARG341TER (191740.0004), which contains a nonsense mutation 
(CGA-to-TGA) in exon 3 and is associated with CN-I and a total absence of all 
phenol/bilirubin UGT proteins and their activities in liver homogenate by enzymologic and 
immunochemical analysis; 

5) UGT1A1 , GLN331ART (191740.0005), which contains an A-to-G transition 5 bp 
upstream of the exon 2/intron 2 boundary (resulting in a glutamine-to-arginine substitution), 
is associated with Crigler-Najjar Syndrome, type II (CN-II); 

6) UGT1A1 , PHE170DEL (191740.0006), which contains a deletion of the 
phenylalanine codon at position 170 in exon 1, and is associated with CN-I; 

7) UGT1A1, SER376PHE (191740.0007), which contains a C-to-T transition in 
codon 376 (resulting in a change of serine to phenylalanine) and is associated with CN-I; 

8) UGT1A1 , GLY309GLU (191740.0008), which contains a G-to-A transition in 
codon 309 (resulting in a glycine to glutamic acid change) and is associated with CN-I; 

9) UGT1A1, NT840, C-A, CYS-TER (191740.0009), which contains a C-to-A 
transversion at base position 840 in exon 1 (resulting in replacing a cysteine with a stop 
codon), is associated with CN-I; 

10) UGT1A1, PR0229GLN (191740.00010), which contains C-to-A transversion at 
nucleotide 686 (changing proline-229 to glutamine), is associated with Gilbert syndrome; 

11) UGT1A1, 2-BP INS, TA INS, TATAA ELEMENT (191740.00011) contains 2 
extra bases (TA) in the TATAA element of the 5-prime promoter region of the gene (where 
normally an A(TA)6TAA element is present between nucleotides -23 and -3) and is 
associated with Gilbert syndrome; and 

12) UGT1A1, 1-BP INS, 470T INS (191740.00012), which contains 470insT 
mutation in exon 1 and is associated with CN-I. 

Summary of the Invention 

Genetic sequence polymorphisms are identified in the UGT1 gene. Nucleic acids 
comprising the polymorphic sequences are used in screening assays, and for genotyping 
individuals. The genotyping information is used to predict an individuals' rate of metabolism 
for UGT1 substrates, potential drug-drug interactions, and adverse/side effects. 

Accordingly, in one aspect the invention features an isolated nucleic acid molecule 
comprising a UGT1 sequence polymorphism of SEQ ID NOS:87-124, as part of other than 
a naturally occurring chromosome. In related aspects, the invention features nucleic acid 
probes for detection of UGT1 locus polymorphisms, where the probe comprises a 
polymorphic sequence of SEQ ID NOS:87-124. 
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fn another aspect the invention features an array of oligonucleotides comprising two 
or more probes for detection of UGT1 locus polymorphisms, where the probes comprise at 
least one form of a polymorphic sequences of SEQ ID NOS:87-124. 

In still another aspect, the invention features a method for detecting in an individual 
5 a polymorphism in UGT1 metabolism of a substrate, where the method comprises 

analyzing the genome of the individual for the presence of at least one UGT1 polymorphism 
of SEQ ID NOS:87-124; wherein the presence of the predisposing polymorphism is 
indicative of an alteration in UGT1 expression or activity. 

In one embodiment, the analyzing step of the method is accomplished by detection 
10 of specific binding between the individual's genomic DNA with an array of oligonucleotides 
comprising two or more probes for detection of UGT1 locus polymorphisms, where the 
probes comprise at least one form of a polymorphic sequence of SEQ ID NOS:87-124. 

In other embodiments of the method, the alteration is UGT1 expression or activity is 
tissue specific, or is in response to a UGT1 modifier. The UGT1 modifier may either induce 
15 or inhibit UGT1 expression. 

Brief Description of the Drawings 
Fig. 1 is a schematic showing the UGT1 locus. Each of the first exons is denoted by 
both its alphabetic and numerical nomenclatures (e.g., 1A and 1.1). 
20 Fig. 2 is a schematic showing exons 1A-1J of the UGT1 locus and the 

polymorphisms described in the present application. 

Fig. 3 is a schematic showing the exons 1A-1F, and 2-5 of the UGT1 locus and the 
polymorphisms that have been publicly disclosed. 

25 Brief Description of the Sequence Listing 

UGT1 Reference Sequences. SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13, and 15 are the 
UGT1 reference polynucleotide sequences for UGT1 exons 1A, 1C, 1D, 1E, 1F, 1G, 1H, 
and 1J. The polypeptide sequences are encoded by these reference exon sequences are 
SEQ ID NOS:2, 4, 6, 8, 12, 14, and 16. SEQ ID NOS: 17 and 18 are the reference 
30 polynucleotide and amino acid sequences for UGT1 exons 2-5. 

PCR Primers. The primary and secondary PCR primers for amplification of 
polymorphic sequences are presented as SEQ ID NOS:19-50. 

Sequencing Primers. The primers used in sequencing isolated polymorphic 
sequences are presented as SEQ ID NOS:51-86. 
35 Polymorphisms. Polymorphic sequences of the invention are presented as SEQ ID 

NOS:88-124. 
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Description of the Specific Embodiments 
Pharmacogenetics is the linkage between an individual's genotype and that 
individual's ability to metabolize or react to a therapeutic agent. Differences in metabolism 
or target sensitivity can lead to severe toxicity or therapeutic failure by altering the relation 
5 between bioactive dose and blood concentration of the drug. Relationships between 

polymorphisms in metabolic enzymes or drug targets and both response and toxicity can be 
used to optimize therapeutic dose administration. 

Genetic polymorphisms are identified in the UGT1 gene. Nucleic acids comprising 
the polymorphic sequences are used to screen patients for altered metabolism for UGT1 
10 substrates, potential drug-drug interactions, and adverse/side effects, as well as diseases 
that result from environmental or occupational exposure to toxins. The nucleic acids are 
used to establish animal, cell culture and in vitro cell-free models for drug metabolism. 



Definitions 

15 It is to be understood that this invention is not limited to the particular methodology, 

protocols, cell lines, animal species or genera, constructs, and reagents described, as such 
may vary. It is also to be understood that the terminology used herein is for the purpose of 
describing particular embodiments only, and is not intended to limit the scope of the present 
invention which will be limited only by the appended claims. 

20 As used herein the singular forms "a", "and", and "the" include plural referents 

unless the context clearly dictates otherwise. Thus, for example, reference to "a construct" 
includes a plurality of such constructs and reference to "the UGT1 nucleic acid" includes 
reference to one or more nucleic acids and equivalents thereof known to those skilled in the 
art, and' so forth. All technical and scientific terms used herein have the same meaning as 

25 commonly understood to one of ordinary skill in the art to which this invention belongs 
unless clearly indicated otherwise. 

UGT1 polymorphic sequences. The sequence of the UGT1 gene is known in the 
art, and accessible in public databases, as cited above. This sequence is useful as a 
reference for the genomic location of the human gene, and for specific coding region 

30 sequences. As used herein, the term "UGT1 gene" is intended to refer to both the wild-type 
and variant sequences, unless specifically denoted otherwise. Nucleic acids of particular 
interest comprise the provided variant nucleotide sequence(s). For screening purposes, 
hybridization probes may be used where both polymorphic forms are present, either in 
separate reactions, or labeled such that they can be distinguished from each other. Assays 

35 may utilize nucleic acids that hybridize to one or more of the described polymorphisms. 
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The genomic UGT1 sequence is of particular interest. A polymorphic UGT1 gene 
sequence, i.e. including one or more of the provided polymorphisms, is useful for 
expression studies to determine the effect of the polymorphisms on enzymatic activity. The 
polymorphisms are also used as single nucleotide polymorphisms to detect genetic 
5 association with phenotypic variation in UGT1 activity and expression. 

The UGT1 exon structure is illustrated in Fig. 1. The UGT1 locus contains at least 
12 promoters/first exons, which are apparently able to splice with common exons 2 through 
5 t producing gene products having different N-terminal halves but identical C-terminal 
halves. The first exon utilized at least in part determines the substrate specificity of the 

10 resulting UGT1 gene product. Each of the first exons in Fig. 1 is denoted by both its 

alphabetic and numerical nomenclatures (e.g., 1A and 1.1). Polymorphisms in the UGT1 
first exon can be associated with alteration of substrate binding specificity and/or disease. 
Fig. 2 shows UGT1 exons 1A-1J and the polymorphisms described in the present 
application. Fig. 3 shows UGT1 exons 1A-1F and 2-5 and the polymorphisms in these 

15 exons that have been publicly disclosed. Polymorphisms denoted by an asterisk (*) have 
been assigned the indicated "allele name" (e.g., *12). The specific associated disease is 
indicated below in parentheses for several of these disease-associated polymorphisms. 
Except for the "mutation" that is associated with Gilbert's (*28, which is not universally 
agreed upon in the literature), all mutations in exons 1D, 1A, and 2-5 were isolated from 

20 individuals with disease. 

Fragments of the DNA sequence are obtained by chemically synthesizing 
oligonucleotides in accordance with conventional methods, by restriction enzyme digestion, 
by PCR amplification, efc. For the most part, DNA fragments will be of at least 15 nt, 
usually at least 20 nt, often at least 50 nt. Such small DNA fragments are useful as primers 

25 for PCR, hybridization screening, etc. Larger DNA fragments, i.e. greater than 100 nt are 
useful for production of the encoded polypeptide, promoter motifs, etc. For use in 
amplification reactions, such as PCR, a pair of primers will be used. The exact composition 
of primer sequences is not critical to the invention, but for most applications the primers will 
hybridize to the subject sequence under stringent conditions, as known in the art. 

30 The UGT1 nucleic acid sequences are isolated and obtained in substantial purity, 

generally as other than an intact mammalian chromosome. Usually, the DNA will be 
obtained substantially free of other nucleic acid sequences that do not include a UGT1 
sequence or fragment thereof, generally being at least about 50%, usually at least about 
90% pure and are typically "recombinant", i.e. flanked by one or more nucleotides with 

35 which it is not normally associated on a naturally occurring chromosome. 
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UGT1 polypeptides. The UGT1 genetic sequence, including polymorphisms, may 
be employed for synthesis of a complete UGT1 protein, or polypeptide fragments thereof, 
particularly fragments corresponding to functional domains; binding sites; etc/, and 
including fusions of the subject polypeptides to other proteins or parts thereof. For 

5 expression, an expression cassette may be employed, providing for a transcriptional and 
translational initiation region, which may be inducible or constitutive, where the coding 
region is operably linked under the transcriptional control of the transcriptional initiation 
region, and a transcriptional and translational termination region. Various transcriptional 
initiation regions may be employed that are functional in the expression host. The 

10 polypeptides may be expressed in prokaryotes or eukaryotes in accordance with 

conventional ways, depending upon the purpose for expression. Small peptides can also 
be synthesized in the laboratory. 

Substrate. A substrate is a chemical entity that is modified by UGT1, usually under 
normal physiological conditions. Although the duration of drug action tends to be shortened 

15 by metabolic transformation, drug metabolism is not "detoxification". Frequently the 
metabolic product has greater biologic activity than the drug itself. In some cases the 
desirable pharmacologic actions are entirely attributable to metabolites, the administered 
drugs themselves being inert. Likewise, the toxic side effects of some drugs may be due in 
whole or in part to metabolic products. 

20 Substrates can be either endogenous substrates (e.g., substrates normally found 

within the natural environment of UGT1, such as the bilirubin or other endobiotic 
compound) or exogenous (e.g., substrates that are not normally found within the natural 
environment of UGT1 , such as ethinyl estradiol or other xenobiotic compound). Exemplary 
UGT1 substrates (/.a, substrates of wild-type UGT1 and/or UGT1 polypeptides encoded by 

25 UGT1 polymorphisms) include, but are not necessarily limited to endobiotics such as 
bilirubin, bilirubin monoglucoronide, bile acids, steroids, thyroxine, biogenic amines, fat- 
soluble vitamins, UDPGA, 17(3 estradiol, estriol, 2-hydroxy-estriol, T4,rT3, and the like; and 
xenobiotics such as hydroxylated polycyclic aromatic hydrocarbons, heterocyclics, 
carcinogens, plant metabolites, octyl gallate, ethinylestradiol, anthraflavic acid, quercetin, 1- 

30 naphthol, naphthylamines, 4-aminobiphenyl, benzidine, imipamine, BP-3,6-quinol, 5- 

hydroxy-BP, acetaminophen, vanillin, naproxen, 4-methylumbelliferone, monohalogenated 
phenols, propofol, 4t-pentyIphenol, 4-hydroxybiphenyl, carvacrol, emodin, galangin, bulky 
phenols, carboxylic acids, 5-hydroxy 2AAF, 8-hydroxy 2AAF, and the like. Table 1 provides 
a summary of the major endobiotic and xenobiotic substrates, as well as exemplary non- 
35 substrates, of four UGT1 isoenzymes (UGT1*1 (same as UGT1A), UGT1*4 (same as 
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UGT1D), UGT1*6 (same as UGT1F), and UGT1*02 (same as UGT1G) (see Burchell et al. 
91995) LifeScL 57:1819-31). 



Table 1 Substrate Specificity of Human Liver UGT1 Isoenzymes 



| Isoenzyme 


Endobiotic 


Xenobiotic 


Non-substrate 


UGT1*I 


Bilirubin (Km 24 um) 
Bilirubin monoglucuronide 
UDPGA (Km 0.41 mM) 
17p estradiol 
Estriol 

2-hydrozy-estriol 
T4,rT3 


Octyl gallate (Km 162 um) 
Ethinylestradiol 

Anthraflavip arirl 

Quercetin 

1-naphthol 


Gallic acid 
T3 

JVivlllilUl 

Retinoic acid 

Clofibrate 

Morphine 

Propofol 

Testosterone 


UGT1*4 


Bilirubin? 


Naphthylamines 
4-aminobiphenyl 
Benzide 
Imip amine 


Bilirubin? 
Carbamazepine 


UGT1*6 




1-Naphthol 

BP-3,6-quinoJ 

5-hydroxy-BP 

Acetaminophen (Km 2 mM) 

Vanillin 

Naproxen 

4-methylumbelliferone 
Monohalogenated phenols 


4-Hydroxybiphenyl 

Propofol 

Galangin 

Emodin 

Morphine 

Estriol 

Estradiol 

AZT 

Menthol 


UGT1*7 


UDPGA (Km 0.41 mM) 
T4,rT3 


Propofol (Km 172 um) 
4t-pentylphenol 

4- hydroxybiphenyI 
Carvacrol 
Emodin 
Galangin 

Octyl gallate (Km 158 uM) 
Other bulky phenols 
Acetaminophen (Km 50 mM) 
Carboxylic acids (some) 

5- hydroxy 2AAF 
8-hydroxy 2AAF 


Morphine 

Estriol 

Estradiol 

AZT 

Menthol 

Chloramphenicol 

Androsterone 

T3 



Modifier A modifier is a chemical agent that modulates the action of UGT1, either 
through altering its enzymatic activity (enzymatic modifier) or through modulation of 
expression (expression modifier, e.g., by affecting transcription or translation). In some 
cases the modifier may also be a substrate. For example, the UGT1 gene contains an 
15 electrophile responsive element (USPN 5,589,504); thus, compounds such as metabolites 
of planar aromatic compounds and phenolic antioxidants, as well as reactive oxygen 
species including peroxides would be expression modifiers via their effect on the 
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electrophile responsive element. Endogenous and exogenous inducers that are capable of 
inducing particular UGT activities include phenobarbital, dioxin, peroxisome proliferators, 
rifamicin, oral contraceptive drug, carbamazepine, cigarette smoke, cabbage, brussel 
sprouts, polycyclic/aromatic hydrocarbons, and derivatives of indole 3-carbonil (see 

5 Burchell et al. (1995), supra, Parkinson In: "Biotransformation of Xenobiotics." Chapter 6, 
Casarett & Doull's Toxicology . 5 th Ed., C. Klaassen, ed.)). 

Pharmacokinetic parameters. Pharmacokinetic parameters provide fundamental 
data for designing safe and effective dosage regimens. A drug's volume of distribution, 
clearance, and the derived parameter, half-life, are particularly important, as they determine 

10 the degree of fluctuation between a maximum and minimum plasma concentration during a 
dosage interval, the magnitude of steady state concentration and the time to reach steady 
state plasma concentration upon chronic dosing. Parameters derived from in vivo drug 
administration are useful in determining the clinical effect of a particular UGT1 genotype. 
Expression assay. An assay to determine the effect of a sequence polymorphism 

15 on UGT1 expression. Expression assays may be performed in cell-free extracts, or by 
transforming cells with a suitable vector. Alterations in expression may occur in the basal 
level that is expressed in one or more cell types, or in the effect that an expression modifier 
has on the ability of the gene to be inhibited or induced. Expression levels of a variant 
alleles are compared by various methods known in the art. Methods for determining 

20 promoter or enhancer strength include quantitation of the expressed natural protein; 
insertion of the variant control element into a vector with a reporter gene such as 
p-galactosidase, luciferase, chloramphenicol acetyltransferase, etc, that provides for 
convenient quantitation; and the like. 

Gel shift or electrophoretic mobility shift assay provides a simple and rapid method 

25 for detecting DNA-binding proteins (Ausubel, F.M. ef a/. (1989) In: Current Protocols in 
Molecular Biology, Vol. 2, John Wiley and Sons, New York). This method has been used 
widely in the study of sequence-specific DNA-binding proteins, such as transcription 
factors. The assay is based on the observation that complexes of protein and DNA migrate 
through a nondenaturing polyacrylamide gel more slowly than free DNA fragments or 

30 double-stranded oligonucleotides. The gel shift assay is performed by incubating a purified 
protein, or a complex mixture of proteins (such as nuclear or cell extract preparations), with 
an end-labeled DNA fragment containing the putative protein binding site. The reaction 
products are then analyzed on a nondenaturing polyacrylamide gel. The specificity of the 
DNA-binding protein for the putative binding site is established by competition experiments 

35 using DNA fragments or oligonucleotides containing a binding site for the protein of 
interest, or other unrelated DNA sequences. 

-10- 



WO 99/57322 PCT7US99/09702 

Expression assays can be used to detect differences in expression of 
polymorphisms with respect to tissue specificity, expression level, or expression in 
response to exposure to various substrates, and/or timing of expression during 
development. For example, since UGT1A and UGT1E are expressed in liver, UGT1A and 
UGT1E polymorphisms could be evaluated for expression in tissues other than liver, or 
expression in liver tissue relative to a reference UGT1A or UGT1E polypeptide. Similarly, 
expression of polymorphisms in UGT1F, which is normally expressed in liver, kidney and 
skin, could be assayed in each of these tissues and the relative levels of expression 
compared to a reference UGT1F polypeptide. 

Substrate screening assay. Substrate screening assays are used to determine the 
metabolic activity of a UGT1 protein or peptide fragment on a substrate. Many suitable 
assays are known in the art, including the use of primary or cultured cells, genetically 
modified cells (e.g., where DNA encoding the UGT1 polymorphism to be studied is 
introduced into the cell within an artificial construct), cell-free systems, e.g. microsomal 
preparations or recombinantly produced enzymes in a suitable buffer, or in animals, 
including human clinical trials (see, e.g., Burchell et ai. (1995) Life Sci. 57:1819-1831, 
specifically incorporated herein by reference. Where genetically modified cells are used, 
since most cell lines do not express UGT1 activity (liver cells lines being the exception), 
introduction of artificial construct for expression of the UGT1 polymorphism into many 
human and non-human cell lines does not require additional modification of the host to 
inactivate endogenous UGT1 expression/activity. Clinical trials may monitor serum, urine, 
etc. levels of the substrate or its metabolite(s). 

Typically a candidate substrate is input into the assay system, and the conversion to 
a metabolite is measured over time. The choice of detection system is determined by the 
substrate and the specific assay parameters. Assays are conventionally run, and will 
include negative and positive controls, varying concentrations of substrate and enzyme, etc. 

Genotyping: UGT1 genotyping is performed by DNA or RNA sequence and/or 
hybridization analysis of any convenient sample from a patient, e.g. biopsy material, blood 
sample (serum, plasma, etc.), buccal cell sample, etc. A nucleic acid sample from an 
individual is analyzed for the presence of polymorphisms in UGT1, particularly those that 
affect the activity or expression of UGT1. Specific sequences of interest include any 
polymorphism that leads to changes in basal expression in one or more tissues, to changes 
in the modulation of UGT1 expression by modifiers, or alterations in UGT1 substrate 
specificity and/or activity. 
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Linkage Analysis: Diagnostic screening may be performed for polymorphisms that 
are genetically linked to a phenotypic variant in UGT1 activity or expression, particularly 
through the use of microsatellite markers or single nucleotide polymorphisms (SNP). The 
microsatellite or SNP polymorphism itself may not phenotypically expressed, but is linked to 
5 sequences that result in altered activity or expression. Two polymorphic variants may be in 
linkage disequilibrium, i.e. where alleles show non-random associations between genes 
even though individual loci are in Hardy-Weinberg equilibrium. 

Linkage analysis may be performed alone, or in combination with direct detection of 
phenotypically evident polymorphisms. The use of microsatellite markers for genotyping is 
10 well documented. For examples, see Mansfield et ai. (1994) Genomics 24:225-233; and 
Ziegle et al. (1992) Genomics 14:1026-1031. The use of SNPs for genotyping is illustrated 
in Underhill et al. (1996) Proc Natl Acad Sci U S A 93:196-200. 

Transgenic animals. The subject nucleic acids can be used to generate genetically 
15 modified non-human animals or site specific gene modifications in ceil lines. The term 
"transgenic" is intended to encompass genetically modified animals having a deletion or 
other knock-out of UGT1 gene activity, having an exogenous UGT1 gene that is stably 
transmitted in the host cells, or having an exogenous UGT1 promoter operably linked to a 
reporter gene. Transgenic animals may be made through homologous recombination, 
20 where the UGT1 locus is altered. Alternatively, a nucleic acid construct is randomly 

integrated into the genome. Vectors for stable integration include plasmids, retroviruses 
and other animal viruses, YACs, and the like. Of interest are transgenic mammals, e.g. 
cows, pigs, goats, horses, etc., and particularly rodents, e.g. rats, mice, etc. 

25 Genetically Modified Cells. Primary or cloned cells and cell lines are modified by the 

introduction of vectors comprising UGT1 gene polymorphisms. The gene may comprise 
one or more variant sequences, preferably a haplotype of commonly occurring 
combinations. In one embodiment of the invention, a panel of two or more genetically 
modified cell lines, each cell line comprising a UGT2B4 polymorphism, are provided for 

30 substrate and/or expression assays. The panel may further comprise cells genetically 
modified with other genetic sequences, including polymorphisms, particularly other 
sequences of interest for pharmacogenetic screening, e.g. UGT1, other UGT2 sequences, 
cytochrome oxidase polymorphisms, etc. 

Vectors useful for introduction of the gene include plasmids and viral vectors, e.g. 

35 retroviral-based vectors, adenovirus vectors, etc. that are maintained transiently or stably in 
mammalian cells. A wide variety of vectors can be employed for transfection and/or 
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integration of the gene into the genome of the cells. Alternatively, micro-injection may be 
employed, fusion, or the like for introduction of genes into a suitable host cell. 



Genotvpina Methods 
The effect of a polymorphism in the UGT1 gene sequence on the response to a 
particular substrate or modifier of UGT1 is determined by in vitro or in vivo assays. Such 
assays may include monitoring the metabolism of a substrate during clinical trials to 
determine the UGT1 enzymatic activity, specificity or expression level. Generally, in vitro 
assays are useful in determining the direct effect of a particular polymorphism, while clinical 
studies will also detect an enzyme phenotype that is genetically linked to a polymorphism. 

The response of an individual to the substrate or modifier can then be predicted by 
determining the UGT1 genotype, with respect to the polymorphism. Where there is a 
differential distribution of a polymorphism by racial background, guidelines for drug 
administration can be generally tailored to a particular ethnic group. 

The basal expression level in different tissue may be determined by analysis of 
tissue samples from individuals typed for the presence or absence of a specific 
polymorphism. Any convenient method may be use, e.g. ELISA, RIA, efc. for protein 
quantitation, northern blot or other hybridization analysis, quantitative RT-PCR, eta for 
mRNA quantitation. The tissue specific expression is correlated with the genotype. 

The alteration of UGT1 expression in response to a modifier is determined by 
administering or combining the candidate modifier with an expression system, e.g. animal, 
cell, in vitro transcription assay, etc. The effect of the modifier on UGT1 transcription 
and/or steady state mRNA levels is determined. As with the basal expression levels, tissue 
specific interactions are of interest. Correlations are made between the ability of an 
expression modifier to affect UGT1 activity, and the presence of the provided 
polymorphisms. A panel of different modifiers, cell types, etc. may be screened in order to 
determine the effect under a number of different conditions. 

A UGT1 polymorphism that results in altered enzyme activity or specificity is 
determined by a variety of assays known in the art. The enzyme may be tested for 
metabolism of a substrate in vitro, for example in defined buffer, or in cell or subcellular 
lysates, where the ability of a substrate to be metabolized by UGT1 under physiologic 
conditions is determined. Where there are not significant issues of toxicity from the 
substrate or metabolite(s), in vivo human trials may be utilized, as previously described. 

The genotype of an individual is determined with respect to the provided UGT1 gene 
polymorphisms. The genotype is useful for determining the presence of a phenotypically 
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evident polymorphism, and for determining the linkage of a polymorphism to phenotypic 
change. 

A number of methods are available for analyzing nucleic acids for the presence of a 
specific sequence. Where large amounts of DNA are available, genomic DNA is used 
5 directly. Alternatively, the region of interest is cloned into a suitable vector and grown in 
sufficient quantity for analysis. The nucleic acid may be amplified by conventional 
techniques, such as the polymerase chain reaction (PCR), to provide sufficient amounts for 
analysis. The use of the polymerase chain reaction is described in Saiki et al. (1985) 
Science 230: 1 350-1 354 T and a review of current techniques may be found in Sambrook et 

10 a/. Molecular Cloning: A Laboratory Manual, CSH Press 1989, pp.14.2-14.33. 

Amplification may be used to determine whether a polymorphism is present, by using a 
primer that is specific for the polymorphism. Alternatively, various methods are known in 
the art that utilize oligonucleotide ligation as a means of detecting polymorphisms, for 
examples see Riley et al. (1990) Nucleic Acids Res 18:2887-2890; and Delahunty et al. 

15 (1 996) Am J Hum Genet 58: 1 239-1 246. 

A detectable label may be included in an amplification reaction. Suitable labels 
include fluorochromes, e.g. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, 
phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6-FAM), 2\7'-dimethoxy-4\5'- 
dichloro-6-carboxyfluorescein (JOE), 6-carboxy-X-rhodamine (ROX), 6-carboxy-2\4\7',4,7- 

20 hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N,N,N\N'-tetramethyl-6- 
carboxyrhodamine (TAMRA), radioactive labels, e.g. 32P, 35S, 3H; etc. The label may be 
a two stage system, where the amplified DNA is conjugated to biotin, haptens, etc. having a 
high affinity binding partner, e.g. avidin, specific antibodies, etc., where the binding partner 
is conjugated to a detectable label. The label may be conjugated to one or both of the 

25 primers. Alternatively, the pool of nucleotides used in the amplification is labeled, so as to 
incorporate the label into the amplification product. 

The sample nucleic acid, e.g. amplified or cloned fragment, is analyzed by one of a 
number of methods known in the art. The nucleic acid may be sequenced by dideoxy or 
other methods. Hybridization with the variant sequence may also be used to determine its 

30 presence, by Southern blots, dot blots, etc. The hybridization pattern of a control and 
variant sequence to an array of oligonucleotide probes immobilized on a solid support, as 
described in U.S. 5,445,934, or in WO95/35505, may also be used as a means of detecting 
the presence of variant sequences. Single strand conformational polymorphism (SSCP) 
analysis, denaturing gradient gel electrophoresis (DGGE), mismatch cleavage detection, 

35 and heteroduplex analysis in gel matrices are used to detect conformational changes 

created by DNA sequence variation as alterations in electrophoretic mobility. Alternatively, 
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where a polymorphism creates or destroys a recognition site for a restriction endonuclease 
(restriction fragment length polymorphism, RFLP), the sample is digested with that 
endonuclease, and the products size fractionated to determine whether the fragment was 
digested. Fractionation is performed by gel or capillary electrophoresis, particularly 
acrylamide or agarose gels. 

In one embodiment of the invention, an array of oligonucleotides are provided, 
where discrete positions on the array are complementary to one or more of the provided 
polymorphic sequences, e.g. oligonucleotides of at least 12 nt, frequently 20 nt, or larger, 
and including the sequence flanking the polymorphic position. Such an array may comprise 
a series of oligonucleotides, each of which can specifically hybridize to a different 
polymorphism. For examples of arrays, see Hacia et al. (1 996) Nat Genet 14:441-447 and 
DeRisi et al. (1996) Nat Genet 14:457-460. Arrays of interest may further comprise 
sequences, including polymorphisms, of other genetic sequences, particularly other 
sequences of interest for pharmacogenetic screening, e.g. UGT1, other UGT2 sequences, 
cytochrome oxidase polymorphisms, etc. 

The genotype information is used to predict the response of the individual to a 
particular UGT1 substrate or modifier. Where an expression modifier inhibits UGT1 
expression, then drugs that are a UGT1 substrate will be metabolized more slowly if the 
modifier is co-administered. Where an expression modifier induces UGT1 expression, a 
co-administered substrate will typically be metabolized more rapidly. Similarly, changes in 
UGT1 activity will affect the metabolism of an administered drug. The pharmacokinetic 
effect of the interaction will depend on the metabolite that is produced, e.g. a prodrug is 
metabolized to an active form, a drug is metabolized to an inactive form, an environmental 
compound is metabolized to a toxin, etc. Consideration is given to the route of 
administration, drug-drug interactions, drug dosage, etc. 

EXAMPLES 

The following examples are put forth so as to provide those of ordinary skill in the 
art with a complete disclosure and description of how to make and use the subject 
invention, and are not intended to limit the scope of what is regarded as the invention. 
Efforts have been made to ensure accuracy with respect to the numbers used (e.g., 
amounts, temperature, concentrations, etc.) but some experimental errors and deviations 
should be allowed for. Unless otherwise indicated, parts are parts by weight, molecular 
weight is average molecular weight, temperature is in degrees centigrade; and pressure is 
at or near atmospheric. 
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Example: Identification of UGT1 Polymorphisms 
MATERIALS AND METHODS 

DNA Samples. Blood specimens were collected from approximately 48 individuals 
after obtaining informed consent. All samples were stripped of personal identifiers to 
maintain confidentiality. Genomic DNA was isolated from these samples using standard 
techniques. Genomic DNA was stored either as a concentrated solution, or in a dried form 
in microtiter plates. 

PCR amplifications. The primers used to amplify all exons are shown in Table 2, and 
were designed with NBI's Oligo version 5.0 program. 
Table 2. PCR Primers. (Ex = Exon) 



PRIMARY PCR AMPLIFICATION 

EX FORWARD PRIMER 

TGGTGTATCGATTGGTTTT (SEQ ID 
NO: 19) 

ACAAGGTAATTAAGATGAAGAAAGCA 
(SEQIDNO:21) . 

TTTGTCTTCCA ATTA CATG C (SEQ ID 
NO:23) 

TCTCAGTGACAAGGTAATTAAGAC(SE 
Q ID NO:25) 

AATTTGGGTTCTTACATATCAA(SEQ ID 
NO:27) 

ATAAGTACACGCCTTCTTTTG (SEQ ID 
NO:29) 

CGCCTACGTATCATAGCAGTTA(SEQ ID 
NO:31) 

TCTTTCCGCCTACTGTATCA (SEQ ID 
NO:33) 



1A 
1C 
ID 
IE 
IF 
1G 
1H 
1J 



REVERSE PRIMER 

CATATATCTGGGGCTAGTTAATC (SEQ ID 
NO:20) 

ACCTGAGATAGTGGCTTCCT (SEQ ID 
NO:22) 

AGTAGATATGGAAGCACTTGTAAG (SEQ 
ID NO:24) 

CATTGATTGGATAAAGGCA (SEQ ID NO:26) 

GAGTGAGGGAGGACAGAG (SEQ ID NO:28) 

GCTGCTTTATACAATTTGCTAC (SEQ ID 
NO:30) 

GGAAAGAAATTTGAAATGCAAC (SEQ ID 
NO:32) 

TTCAAGA AGGGCAGTTTTAT (SEQ ID 
NO:34) 



SECONDARY PCR AMPLIFICATION 
EX FORWARD PRIMER 

1 A CTCTGGCAGGAGCA AAG (SEQ ID 
NO:35) 

1C GGTAATTAAGATGAAGAAAGCA(SEQ 
IDNO:37) 

ID GTGGCTCAATGACAAGG (SEQ ID 
NO:39) 

IE TTAAGACGAAGGAAACAATTCT(SEQ 
IDNO:41) 

IF ATCAAAGGGTAAAATTCAGA (SEQ ID 
NO:43) 

1G TTTTGAGGGCAGGTTCTA (SEQ ID 
NO:45) 

1 H TTCTCTCATGGCTCGCA (SEQ ID NO:47) 

1 J CCGCCTACTGTATCATAGC A (SEQ ID 
NO:49) 



REVERSE PRIMER 

ATACACACCTGGGATAGTGG (SEQ ID 
NO:36) 

CTGAGATAGTGGCTTCCTG (SEQ ID NO:38) 

ATATGGAAGCACTTGTAAGTAAA(SEQ ID 
NO:40) 

ACCTGAGATAGTGGCTTCC (SEQ ID NO:42) 

GGCAGTCCAAAAGAAATA (SEQ ID NO:44) 

AATGGGACAAATGTAAATGATA (SEQ ID 
NO:46) 

ATGTCAAATCACAATTCAGTAAGG (SEQ ID 
NO:48) 

CAACGAAATGTCAAATCACAG (SEQ ID 
NO:50) 
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Publicly available genomic sequences were used as references. Twenty-five 
nanograms of genomic DNA were amplified in the primary amplifications using the Perkin 
Elmer GeneAmp PCR kit according to the manufacturer's instructions in 25 jjI reactions with 
AmpliTaq Gold DNA polymerase. Reactions contained 25 mM MgCI2 and 0.2 pM of each 
5 primer. Thermal cycling was performed using a GeneAmp PCR System 9600 PCR machine 
(Perkin Elmer), utilizing a touch-down PCR protocol. The protocol, unless indicated otherwise 
in Table 3, consisted of an initial incubation of 95 °C for 10 min, followed by eight cycles of 
95°C for 20 sec, 66°C (minus 1 °C per cycle) for 15 sec, 72*C for 2 min, and twenty-seven 
cycles of 95°C for 20 sec, 54°C for 15 sec, 72°C for 2 min, and one final extension step of 
10 72°Cfor10min. 

For the secondary PCR reactions, one microliter of each primary PCR reaction was re- 
amplified using the secondary PCR primers, also listed in Table 2. The thermal cycling profile 
that was used for the primary PCR for an exon was used for the secondary PCR. 

1 5 Table 3. Cycling Profile Modifications 



Exon 


Primary PCR 


Secondary PCR 


IE 


Touch-Down PCR step: 8 cycles 

64 C (minus 1 C per cycle), for 15 sec 

Total Number of cycles: 35 


same as Primary PCR 


IF 


Touch-Down PCR step: 10 cycles 
64 C (minus 1 C per cycle), for 15 sec 
Total Number of cycles: 35 


same as Primary PCR 


1G 


Touch-Down PCR step: 7 cycles 

64 C (minus 1 C per cycle), for 15 sec 

Total Number of cycles: 35 


same as Primary PCR 


1H 


Touch-Down PCR step: 10 cycles 
66 C (minus 1 C per cycle), for 15 sec 
Total Number of cycles: 35 


same as Primary PCR 



30 DNA sequencing. PCR products from 48 individuals, approximately one-third 

representing each of the 3 major racial groups (see above), were prepared for sequencing 
by treating 8 pL of each PCR product with 0.15 uL of exonuclease I (1.5 U/reaction), 0.3 uL 
of Shrimp Alkaline Phosphatase (0.3 U/reaction), q.s. to 10 pL with MilliQ water, and 
incubated at 37°C for 15 min, followed by 72°C for 15 min. Cycle sequencing was 

35 performed on the GeneAmp PCR System 9600 PCR machine (Perkin Elmer) using the ABI 
Prism dRhodamine Terminator Cycle Sequencing Ready Reaction Kit according to the 
manufacturer's directions, with the following changes: (1) 2 pL of dRhodamine terminator 
premix, instead of 8 uL; and (2) 10% (v/v) Dimethylsulfoxide was added to each individual 
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nucleotide. The oligonucleotide primers (unlabeled), at 3 picomoles per reaction, used for 
the sequencing reactions are listed in Table 4. Sequencing reactions, with a final volume of 
5 pL, were subjected to 30 cycles at 96°C for 20 sec, 50°C for 5 sec, and 60°C for 4 min, 
followed by ethanol precipitation. After decanting the ethanol, samples were evaporated to 
5 dryness using a SpeedVac for roughly 1 5 min and were resuspended in 2 pi of loading 
buffer (5:1 deionized formamide:50 mM EDTA pH 8.0). The samples were then, heated to 
94°C for 2 min, and electrophoresed through 5.25% polyacrylamide/6M urea gels in an ABI 
Prism 377 DNA Sequencer according to the manufacturer's instructions for sequence 
determination. All sequences were determined from both the 5' and 3' (sense and 
10 antisense) direction. 

Of the forty-eight samples, 38 polymorphisms were identified. The polymorphisms 
are described in Table 5 below. 
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Table 4. Sequencing Primers (No. = Polymorphism No.) 

No. FORWARD PRIMER REVERSE PRIMER 

CTCTGGCAGGAGCAAAG (SEQ IDNO:51) ACAGTGGGCAGAGACAG (SEQ ID 

NO:52) 

GTGGTTTATTCCCCGTAT (SEQ ID NO:53) ATACACACCTGGGATAGTGG (SEQ ID 

NO:54) 

GGTAATTAAGATGAAGAAAGCA (SEQ 
IDNO:55) 



1 

2 

3-5 
6 

7,8 



GAAATGGCATAGGTTGTC (SEQ ID 
NO:56) 



GGCCACACTCAACTGTA (SEQ ID NO:57) CTCAAAAAAAACACAGTAGG (SEQ ID 

NO:58) 

ACTTTTTCTG CCCCTTAT (SEQ ID NO:59) ATATGGAAGCACTTGTAAGTAAA (SEQ 

ID NO:60) 

9- 1 2 TTAAGACGAAGGA AACA ATTCT (SEQ ID AATGGCATACGTTGTCA (SEQ ID NO'62) 
NO:61) V } 



13,14 AGAATGGCAATTATGAACA (SEQ ID 
NO:63) 

15-17 AGAATGGCAATTATGAACA (SEQ ID 
NO:65) 

1 8-24 CTCTGGC 1 CTGTCCTAC* (SEQ ID 
NO:67) 

25 ATCAAAGGGTAAAATTCAGA (SEQ ID 
NO:69) 

26 AATTTGCTTTTGAAAGAATC (SEQ ID 
NO:71) 

27,28 AATTTGCTTTTGAAAGAATC (SEQ ID 
NO:73) 

29,30 TTTTGAGGGCAGGTTCTA (SEQ ID 
NO:75) 

3 1 ,32 TTGCAGGAGTTTGTTTAAT (SEQ ID 
NO:77) 

33 CATTGCAGGAGTTTGTTTA (SEQ ID 
NO:79) 

34 AGAAATAGCCTCTGAAATTC (SEQ ID 
NO:81) 

35 CCGCCTACTGTATCATAGCA (SEQ ID 
NO:83) 

36-38 ATTTTGCCAGTATCTTTTTAG (SEQ ID 



TGTGTGCCCTTAAAGTCT (SEQ ID 
NO:64) 

ACCTGAGATAGTGGCTTCC (SEQ ID 
NO:66) 

ACCTGAGATAGTGGCTTCC (SEQ ID 
NO:68) 

CAGCAGCTTGTCACCTAC (SEQ ID 
NO:70) 

GGTAGGCCCAAATACTCA (SEQ ID 
NO:72) 

GGCAGTCCAAAAGAAATA (SEQ ID 
NO:74) 

CACCTCTGGCATGACTAC (SEQ ID 
NO:76) 

AATGGGACAAATGTAAATGATA (SEQ 
ID NO:78) 

CATCTGAGAACCCTAAGAGA (SEQ ID 
NO:80) 

ATGTCAAATCACAATTCAGTAAGG(SEQ 
ID NO:82) 

GAGTGTACGAGGTTGAGTAAG (SEQ ID 
NO:84) 

CAACGAAATGTCAAATCACAG (SEQ ID 
NO:86) 



NO:85) 

* Note polymorphism in primer. The reference sequence has a "C" at the highlighted 
position. 
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Table 5. UGT1 polymorphisms. Amino acid changes numbered from first methionine for 
that exon (Ex). 



5 



10 



15 



20 



No 


Ex 


Ntd 


AA 


SEQUENCE (SEQ ID NO:) 


I 


1A 


G227 A 


Gly 71 Arg 


CATCAGAGAC A GAGCATTTTACACCTT(SEQ ID 
NO:87) 


7 
z 


1 A 


T 7A^ C 


Oa.*< oci 

oer Z3 1 Pro 


GGACCTATTGAGC C CTGC ATCTGTCT (SEQ ID NO:88) 


3 


1C 


T75C 


Trp 1 1 Arg 


GGTTCCCCTGCCG C GGCTGGCCACA (SEQ ID NO:89) 


4 


1C 


G 125 A 




GCCCTGGGCTGA A AGTGGAAAG (SEQ ID NO:90) 


5 


1C 


T 184 C 


Val 47 Ala 


ATGCGGGAGG C CTTGCGGGAGCT (SEQ ID NO:91) 


6 


1C 


A 521 G 




CTCTGCGCGGC G GTGCTGGCTAAG (SEQ ID NO:92) 


7 


ID 


G 848 A 




TACCCCAGGCC A ATC ATGCCCAACA (SEQ ID NO:93) 


8 


ID 


C43T 


Intronic 


TCCAGGCAAAA T ACTTTTTA AAAAATG(SEQ ID 
NO:94) 


9 


IE 


T187C 


Leu 48 Ser 


AGCATGCGGGAGGCCT C GCGGGA (SEQ ID NO:95) 


10 


IE 


C 194 G 


Asp 58 Glu 


GCGGGA G CTCCATGCGAGAGG (SEQ ID NO:96) 


11 


IE 


T232 C 


Leu 63 Pro 


TGGTGGTCCTCACCC C GGAGGTGAA (SEQ ID NO:97) 


12 


IE 


A257G 




TACATCAAAGA G GAGAACTTTTTCAC (SEQ ID 
NO:98) 


I - * 
i j 


it, 


L/ 40o A 


His 142 Asn 


TGATCAGGCACCTG A ATGCTACTTCC (SEQ ID NO:99) 


\A 
14 


1 P 

iJb 


L Jl / u 


Al n IfO PI.. 

Ala 158 Gly 


ACCTCTGCG G GGCGGTGCTGG (SEQ ID NO: 1 00) 


i J 


IP 


r* ACQ T 




AAGAACATGCT T TACCCTCTGGC (SEQ ID NO: 101) 


1£ 
1 0 


1 p 

IE. 


C 7ft1 T 
I /l/l J 




CTCTGGC T CTGTCCTACC (SEQ ID NO: 102) 


1 7 


1 P 


VW 1 / 1 




TCCTACC1 1 1GC T ATGCTGTTTCT (SEQ ID NO: 103) 


I o 


1 P 
111 


L. /50 A 


Leu 248 lie 


TGTCAGTGGTGGAT A TT (SEQ ID NO: 104) 


10 


1 P 

1 Jc 


n 7co 
vj /sy c 


Val 24y Leu 


GGTGGAT A TT C^TCAGC (SEQ ID NO: 105) 


zu 


1 p 
It 


O 7QC T 


T_T* Af» T_ 

His 25 1 Tyr 


TCAGC T ATGCATC (SEQ ID NO: 106) 


Z 1 


1 p 

IE, 


T 

1 o\jo L, 


C*«- 7C7 DU™ 

ber 253 Pne 


GCATC C GTGTGGCTGTTCCGA (SEQ ID NO: 1 07) 


99 

zz 


1 P 

IE, 


U ol? t 


Oly 259 Arg 


TGGCTGTTCCGA C.GGGACTT (SEQ ID NO: 108) 


91 
Zj 


1 P 


1 OZ / 




GGGACTT C GTGATGGA (SEQ ID NO: 109) 


94 

ZH 


IP 

IE, 






GTGATGGA C TACCCCAGGCCGAT (SEQ ID NO:l 10) 


9^ 

Z«> 


IP 


1 101 vj 


oer / Ala 


CCTGCCTCCTTCGC G CATTTCAGAG (SEQ ID NO:l 1 1) 


9£ 
ZO 


Ir 


A 4 j / vj 




GCGATCATTCCT G ACTGCTCCTC AG (SEQ ID NO: 1 12) 


97 
z / 


1 P 
ir 


A 05.5 vj 


inr l5l Ala 


CCCTGGAGCAT G CATTCAGCAG (SEQ ID NO:l 13) 


9R 
Zo 


1 P 
ir 


a oy4 v^ 


A «-» 1 OA O 

Arg 1 84 ber 


CATTCAGCAG C AGCCCAGACCCT (SEQ ID NO:l 14) 


90 


i n 

1 VJ 


1 JJ VJ 




TACTTCTTCCAC g. TACTATATTA (SEQ ID NO: 1 15) 




I VJ 


C 194 A 
V^ i Z4 A 




GGCCTCCTTCC A CTATATGTGTGT (SEQ ID NO:l 16) 


^1 
_> 1 


in 


T 719 C 
1 / 1Z V* 


T»-*> one A 

1 rp zUd Arg 


GGAGAGAGTA C GGAACCACAT (SEQ ID NO:l 17) 


Jl- 


in 

1 VJ 


n R4£ A 
VJ OHO A 




TCAATTTGGTT A, TTGCGAACTGA (SEQ ID NO: 1 1 8) 






G 518 C 


Gly 173 Ala 


CAGGGGAATAG C TTGCCACTAT (SEQ ID NO: 1 1 9) 


^4 
jf 


1H 


A765G 




TGTTGCGAAC G GACTTTGTTTTGG (SEQ ID NO: 120) 


35 


U 


G 127 A 




TTCACCAGCA A TCGGTGGTGG (SEQ ID NO:121) 


36 


1J 


C694T 




CTAGAAATAGC T TCTG AAATTCTCC (SEQ ID NO: 122) 


37 


1J 


C 731 A 


Leu 244 He 


CGGCATATGAT A TCTACAGTCACA (SEQ ID NO: 123) 


38 


1J 


T761 C 


Arg 254 Stop 


TCAATTTGGTTG C TGCGAACAGGAC (SEQ ID NO: 124) 




The asterisk associated with the second nucleotide residue in polymorphism no. 19 



is in the sequence surrounding the newly discovered polymorphism at residue 789 
(nucleotide change from C at residue 786 to A). 
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All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. The citation of any 
publication is for its disclosure prior to the filing date and should not be construed as an 
5 admission that the present invention is not entitled to antedate such publication by virtue of 
prior invention. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be readily apparent 
to those of ordinary skill in the art in light of the teachings of this invention that certain 
10 changes and modifications may be made thereto without departing from the spirit or scope 
of the appended claims. 
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1. An isolated nucleic acid molecule comprising a UGT1 sequence 
polymorphism of SEQ ID NOS: 87-124, as part of other than a naturally occurring 

5 chromosome. 

2. A nucleic acid probe for detection of UGT1 locus polymorphisms, comprising 
a polymorphic sequence of SEQ ID NOS:87-124. 

10 3. A nucleic acid probe according to Claim 2, wherein said probe is conjugated 

to a detectable marker. 

4. An array of oligonucleotides comprising: 

two or more probes for detection of UGT1 locus polymorphisms, said probes 
15 comprising at least one form of a polymorphic sequences of SEQ ID NOS:87-124. 

5. A method for detecting in an individual a polymorphism in UGT1 metabolism 
of a substrate, the method comprising: 

analyzing the genome of said individual for the presence of at least one UGT1 
20 polymorphism of SEQ ID NOS:87-1 24; wherein the presence of said predisposing 
polymorphism is indicative of an alteration in UGT1 expression or activity. 

6. A method according to Claim 5, wherein said analyzing step comprises 
detection of specific binding between the genomic DNA of said individual with an array of 

25 oligonucleotides comprising: 

two or more probes for detection of UGT1 locus polymorphisms, said probes 
comprising at least one form of a polymorphic sequence of SEQ ID NOS:87-124. 

7. A method according to Claim 5, wherein said alteration in UGT1 expression 
30 is tissue specific. 

8. A method according to Claim 5, wherein said alteration in UGT1 expression 
is in response to a UGT1 modifier. 

35 9. A method according to Claim 8, wherein said modifier induces UGT1 

expression. 
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10. A method according to Claim 8, wherein said modifier inhibits UGT1 
expression. 
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SEQUENCE LISTING 



<110> Penny, Laura 

Galvin, Margaret 



<120> Genotyping the Human 

UDP-Glucuronosyltransf erase 1 (UGT1) Gene 



<130> SEQ-17WO 

<140> Unassigned 
<141> 1999-05-04 

<150> 60/084,807 
<151> 1998-05-07 

<160> 124 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 864 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> (1). 



. (864) 



<400> 1 

atg get gtg gag tec cag ggc gga cgc cca ctt gtc ctg ggc ctg ctg 4 8 

Met Ala Val Glu Ser Gin Gly Gly Arg Pro Leu Val Leu Gly Leu Leu 
15 10 15 

ctg tgt gtg ctg ggc cca gtg gtg tec cat get ggg aag ata ctg ttg 96 
Leu Cys Val Leu Gly Pro Val Val Ser His Ala Gly Lys He Leu Leu 
20 25 ^ ' 30 

ate cca gtg gat ggc age cac tgg ctg age atg ctt ggg gee ate cag 144 
He Pro Val Asp Gly Ser His Trp Leu Ser Met Leu Gly Ala He Gin 
35 40 45 

cag ctg cag cag agg gga cat gaa ata gtt gtc eta gca cct gac gec 192 
Gin Leu Gin Gin Arg Gly His Glu He Val Val Leu Ala Pro Asp Ala 
50 55 60 

teg ttg tac ate aga gac gga gca ttt tac acc ttg aag acg tac cct 24 0 

Ser Leu Tyr He Arg Asp Gly Ala Phe Tyr Thr Leu Lys Thr Tyr Pro 
65 70 75 80 

gtg cca ttc caa agg gag gat gtg aaa gag tct ttt gtt agt etc ggg 288 
Val Pro Phe Gin Arg Glu Asp Val Lys Glu Ser Phe Val Ser Leu Gly 
85 90 95 

cat aat gtt ttt gag aat gat tct ttc ctg cag cgt gtg ate aaa aca 336 
His Asn Val Phe Glu Asn Asp Ser Phe Leu Gin Arg Val He Lys Thr 
100 105 110 

tac aag aaa ata aaa aag gac tct get atg ctt ttg tct ggc tgt tec 384 
Tyr Lys Lys He Lys Lys Asp Ser Ala Met Leu Leu Ser Gly Cys Ser 
115 120 125 
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cac tta ctg cac aac aag gag etc atg gec tec ctg gca gaa age age 432 
His Leu Leu His Asn Lys Glu Leu Met Ala Ser Leu Ala Glu Ser Ser 
130 135 140 

ttt gat gtc atg ctg acg gac cct ttc ctt cct tgc age ccc ate gtg 4 80 

Phe Asp Val Met Leu Thr Asp Pro Phe Leu Pro Cys Ser Pro He Val 
145 150 155 i 60 

gee cag tac ctg tct ctg ccc act gta ttc ttc ttg cat gca ctg cca 528 
Ala Gin Tyr Leu Ser Leu Pro Thr Val Phe Phe Leu His Ala Leu Pro 
!65 170 175 

tgc age ctg gaa ttt gag get acc cag tgc ccc aac cca ttc tec tac 576 
Cys Ser Leu Glu Phe Glu Ala Thr Gin Cys Pro Asn Pro Phe Ser Tyr 
180 185 190 

gtg ccc agg cct etc tec tct cat tea gat cac atg acc ttc ctg cag 624 
Val Pro Arg Pro Leu Ser Ser His Ser Asp His Met Thr Phe Leu Gin 
195 200 205 

egg gtg aag aac atg etc att gee ttt tea cag aac ttt ctg tgc gac 672 
Arg Val Lys Asn Met Leu He Ala Phe Ser Gin Asn Phe Leu Cys Asp 
210 215 220 

gtg gtt tat tec ccg tat gca acc ctt gee tea gaa ttc ctt cag aga 720 
Val Val Tyr Ser Pro Tyr Ala Thr Leu Ala Ser Glu Phe Leu Gin Ara 
225 230 235 240 

gag gtg act gtc cag gac eta ttg age tct gca tct gtc tgg ctg ttt 768 
Glu Val Thr Val Gin Asp Leu Leu Ser Ser Ala Ser Val Trp Leu Phe 
245 250 255 

aga agt gac ttt gtg aag gat tac cct agg ccc ate atg ccc aat atg 816 
Arg Ser Asp Phe Val Lys Asp Tyr Pro Arg Pro He Met Pro Asn Met 
260 265 270 

gtt ttt gtt ggt gga ate aac tgc ctt cac caa aat cca eta tec cag 864 
Val Phe Val Gly Gly He Asn Cys Leu His Gin Asn Pro Leu Ser Gin 
275 280 285 



<210> 2 
<211> 288 
<212> PRT 

<213> Homo sapiens 



<400> 2 



Met 
1 


Ala 


Val 


Glu 


Ser Gin Gly Gly 
5 


Leu Cys 


Val 


Leu 


Gly Pro Val Val 








20 




He 


Pro 


Val 


Asp 


Gly Ser His Trp 






35 




40 


Gin 


Leu 


Gin 


Gin 


Arg Gly His Glu 




50 






55 


Ser 


Leu 


Tyr 


He 


Arg Asp Gly Ala 


65 








70 


Val 


Pro 


Phe 


Gin 


Arg Glu Asp Val 










85 


His 


Asn 


Val 


Phe 


Glu Asn Asp Ser 








100 


Tyr 


Lys 


Lys 


He 


Lys Lys Asp Ser 






115 




120 



Arg Pro Leu Val Leu Gly Leu Leu 

10 15 
Ser His Ala Gly Lys lie Leu Leu 
25 30 
Leu Ser Met Leu Gly Ala He Gin 
45 

He Val Val Leu Ala Pro Asp Ala 
60 

Phe Tyr Thr Leu Lys Thr Tyr Pro 

75 80 
Lys Glu Ser Phe Val Ser Leu Gly 

90 95 
Phe Leu Gin Arg Val He Lys Thr 
105 HO 
Ala Met Leu Leu Ser Gly Cys Ser 
125 
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His 


Leu 


Leu 


His 


Asn 


Lys 


Glu 


Leu 


Met 


Al » 

r\J- a 


oer 


Leu 


7\ 1 -> 

/\i a 


blU 


Ser 


Ser 




130 










135 


















Phe 


Asp 


Val 


Met 


Leu 


Thr 


ASD 


Pro 


Phe 


T .on 


p y~ r% 




Q £S V 


Pro 


Tl - 

lie 


val 


145 










150 










iJ J 








1 o(J 


Ala 


Gin 


Tyr 


Leu 


Ser 
165 


Leu 


Pro 


Thr 


Val 


c lit; 

170 


tr l It: 


Lgu 


fllS 


Aia 


Leu 
175 


Pro 


Cys 


Ser 


Leu 


Glu 


Phe 


Glu 


Ala 


Thr 


Gin 


vyo 




noil 


ir ro 


PK q 

rne 


ber 


Tyr 








180 










185 










t on 

1 :?U 




Val 


Pro 


Arq 


Pro 


Leu 


Ser 


Ser 


His 


Ser 


Acn 
nop 


Pic: 


Uc L 


i nr 


rne 


Leu 


bin 






195 










200 








J 








Arg 


Val 


Lys 


Asn 


Met 


Leu 


He 


Ala 


Phe 


Ser 


Gin 


A en 
noli 


IT Xltr 


Leu 


Cys 


Asp 




210 










215 
















Val 


Val 


Tvr 


Ser 


Pro 


Tyr Ala 


Thr 


Leu 


Ala 


OtiX 


Pit! 
b-LU 


rne 


Leu 


bin 


Arg 


225 










230 










235 










240 


Glu 


Val 


Thr 


Val 


Gin 


Asp 


Leu 


Leu 


Ser 


Ser 


Ala 


Ser 


Ua 1 
v a± 


irp 


Leu 


irne 










245 










250 








255 




Arg 


Ser 


Asp 


Phe 


Val 


Lys 


Asp 


Tyr 


Pro Arg 


Pro 


He 


Met 


Pro 


Asn 


Met 








260 










.265 










270 






Val 


Phe 


Val 
275 


Gly 


Gly 


He 


Asn 


Cys 
280 


Leu 


His 


Gin 


Asn 


Pro 
285 


Leu 


Ser 


Gin 



<210> 3 

<211> 867 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (867) 

<400> 3 

atg gec aca gga etc cag gtt ccc ctg ccg tgg ctg gec aca gga ctg 48 

Met Ala Thr Gly Leu Gin Val Pro Leu Pro Trp Leu Ala Thr Gly Leu 

15 10 15 

ctg ctt etc etc agt gtc cag ccc tgg get gag agt gga aag gtg ttg 96 
Leu Leu Leu Leu Ser Val Gin Pro Trp Ala Glu Ser Gly Lys Val Leu 
20 25 30 

gtg gtg ccc att gat ggc age cac tgg etc age atg egg gag gtc ttg 144 
Val Val Pro He Asp Gly Ser His Trp Leu Ser Met Arg Glu Val Leu 
35* ' 40 45 

egg gag etc cat gec aga ggc cac cag gca gtg gtc etc acc cca gag 192 
Arg Glu Leu His Ala Arg Gly His Gin Ala Val Val Leu Thr Pro Glu 
50 55 60 

gtg aat atg cac ate aaa gaa gag aac ttt ttc acc ctg aca acc tat 240 
Val Asn Met His He Lys Glu Glu Asn Phe Phe Thr Leu Thr Thr Tyr 
65 70 75 80 

gec att teg tgg acc cag gat gaa ttt gat cgc cat gtg ctg ggc cac 288 
Ala He Ser Trp Thr Gin Asp Glu Phe Asp Arg His Val Leu Gly His 
85 90 95 

act caa ctg tac ttt gaa aca gaa cat ttt ctg aag aaa ttt ttc aga 336 
Thr Gin Leu Tyr Phe Glu Thr Glu His Phe Leu Lys Lys Phe Phe Arg 
100 105 " HO 

agt atg gca atg ttg aac aat atg tct ttg gtc tat cat agg tct tgt 384 
Ser Met Ala Met Leu Asn Asn Met Ser Leu Val Tyr His Arg Ser Cys 
115 120 125 
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gtg gag eta eta cat aat gag gec ctg ate agg cac ctg aat get act 432 
Val Glu Leu Leu His Asn Glu Ala Leu He Arg His Leu Asn Ala Thr 
130 135 140 

tec ttt gat gtg gtt tta aca gac ccc gtt aac etc tgc gcg gca gtg 480 
Ser Phe Asp Val Val Leu Thr Asp Pro Val Asn Leu Cys Ala Ala Val 
145 150 155 160 

ctg get aag tac ctg teg att cct act gtg ttt ttt ttg agg aac att 528 
Leu Ala Lys Tyr Leu Ser He Pro Thr Val Phe Phe Leu Arg Asn He 
165 170 175 

cca tgt gat tta gac ttt aag ggc aca cag tgt cca aac cct tec tec 576 
Pro Cys Asp Leu Asp Phe Lys Gly Thr Gin Cys Pro Asn Pro Ser Ser 
180 185 190 

tat att cct aga tta eta aca ace aat tea gac cac atg aca ttc atg 624 
Tyr He Pro Arg Leu Leu Thr Thr Asn Ser Asp His Met Thr Phe Met 
195 200 205 

caa agg gtc aag aac atg etc tac cct ctg gee ctg tec tac att tgc 672 
Gin Arg Val Lys Asn Met Leu Tyr Pro Leu Ala Leu Ser Tyr He Cys 
210 215 220 

cat get ttt tct get cct tat gca age ctt gee tct gag ctt ttt cac 720 
His Ala Phe Ser Ala Pro Tyr Ala Ser Leu Ala Ser Glu Leu Phe Gin 
225 230 235 240 

aga gag gtg tea gtg gtg gat att etc agt cat gca tct gtg tgg ctg 768 
Arg Glu Val Ser Val Val Asp He Leu Ser His Ala Ser Val Trp Leu 
245 250 255 

ttc cga ggg gac ttt gtg atg gac tac ccc agg cca ate atg ccc aac 816 
Phe Arg Gly Asp Phe Val Met Asp Tyr Pro Arg Pro He Met Pro Asn 
260 265 270 

atg gtc ttc att ggg ggc ate aac tgt gec aac agg aag cca eta tct 864 
Met Val Phe He Gly Gly He Asn Cys Ala Asn Arg Lys Pro Leu Ser 
275 280 285 

ca 9 8 67 

Gin 



<210> 4 
<211> 289 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Ala Thr Gly Leu Gin Val Pro Leu Pro Trp Leu Ala Thr Gly Leu 

15 10 15 

Leu Leu Leu Leu Ser Val Gin Pro Trp Ala Glu Ser Gly Lys Val Leu 

20 25 30 

Val Val Pro lie Asp Gly Ser His Trp Leu Ser Met Arg Glu Val Leu 

35 40 45 

Arg Glu Leu His Ala Arg Gly His Gin Ala Val Val Leu Thr Pro Glu 

50 55 60 

Val Asn Met His He Lys Glu Glu Asn Phe Phe Thr Leu Thr Thr Tyr 
65 70 75 80 

Ala He Ser Trp Thr Gin Asp Glu Phe Asp Arg His Val Leu Gly His 
85 90 95 
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Thr Gin Leu Tyr Phe Glu Thr Glu His Phe Leu Lys Lys Phe Phe Arg 

100 105 no 

Ser Met Ala Met Leu Asn Asn Met Ser Leu Val Tyr His Arg Ser Cys 

115 120 125 

Val Glu Leu Leu His Asn Glu Ala Leu lie Arg His Leu Asn Ala Thr 

130 135 140 

Ser Phe Asp Val Val Leu Thr Asp Pro Val Asn Leu Cys Ala Ala Val 
145 150 155 160 

Leu Ala Lys Tyr Leu Ser He Pro Thr Val Phe Phe Leu Arg Asn He 

165 170 175 

Pro Cys Asp Leu Asp Phe Lys Gly Thr Gin Cys Pro Asn Pro Ser Ser 

180 185 190 

Tyr He Pro Arg Leu Leu Thr Thr Asn Ser Asp His Met Thr Phe Met 

195 200 205 

Gin Arg Val Lys Asn Met Leu Tyr Pro Leu Ala Leu Ser Tyr He Cys 

210 215 220 

His Ala Phe Ser Ala Pro Tyr Ala Ser Leu Ala Ser Glu Leu Phe Gin 
225 2 30 235 240 

Arg Glu Val Ser Val Val Asp He Leu Ser His Ala Ser Val Trp Leu 

2 ^5 250 255 

Phe Arg Gly Asp Phe Val Met Asp Tyr Pro Arg Pro He Met Pro Asn 

260 265 270 

Met Val Phe He Gly Gly He Asn Cys Ala Asn Arg Lys Pro Leu Ser 
275 280 285 

Gin 



<210> 5 

<211> 867 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (867) 

<400> 5 

atg gcc aga gga etc cag gtt ccc ctg ccg egg ctg gee aca gga ctg 4 8 

Met Ala Arg Gly Leu Gin Val Pro Leu Pro Arg Leu Ala Thr Gly Leu 
1 5 10 15 

ctg etc etc etc agt gtc cag ccc tgg get gag agt gga aag' gtg ttg 96 
Leu Leu Leu Leu Ser Val Gin Pro Trp Ala Glu Ser Gly Lys Val Leu 
2 0 25 30 

gtg gtg ccc act gat ggc age ccc tgg etc age atg egg gag gcc ttg 144 
Val Val Pro Thr Asp Gly Ser Pro Trp Leu Ser Met Arg Glu Ala Leu 
35 40 45 

egg gag etc cat gcc aga ggc cac cag gcg gtg gtc etc acc cca gag 192 
Arg Glu Leu His Ala Arg Gly His Gin Ala Val Val Leu Thr Pro Glu 
50 55 60 

gtg aat atg cac ate aaa gaa gag aaa ttt ttc acc ctg aca gcc tat 240 
Val Asn Met His He Lys Glu Glu Lys Phe Phe Thr Leu Thr Ala Tyr 
65 70 75 80 

get gtt cca tgg acc cag aag gaa ttt gat cgc gtt acg ctg ggc tac 288 
Ala Val Pro Trp Thr Gin Lys Glu Phe Asp Arg Val Thr Leu Gly Tyr 
85 90 95 

act caa ggg ttc ttt gaa aca gaa cat ctt ctg aag aga tat tct aga 336 
Thr Gin Gly Phe Phe Glu Thr Glu His Leu Leu Lys Arg Tyr Ser Arg 
100 105 HQ 
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agt atg gca att atg aac aat gta tct ttg gcc ctt cat agg tgt tgt 384 
Ser Met Ala lie Met Asn Asn Val Ser Leu Ala Leu His Arg Cys Cys 
115 120 125 

gtg gag eta ctg cat aat gag gcc ctg ate agg cac ctg aat get act 4 32 

Val Glu Leu Leu His Asn Glu Ala Leu lie Arg His Leu Asn Ala Thr 
130 135 140 

tec ttt gat gtg gtt tta aca gac ccc gtt aac etc tgt ggg gcg gtg 480 
Ser Phe Asp Val Val Leu Thr Asp Pro Val Asn Leu Cys Gly Ala Val 
i4 5 150 155 160 

ctg get aag tac ctg teg att cct get gtg ttt ttt tgg agg tac att 528 
Leu Ala Lys Tyr Leu Ser He Pro Ala Val Phe Phe Trp Arg Tyr He 
165 170 ~ 175 

cca tgt gac tta gac ttt aag ggc aca cag tgt cca aat cct tec tec 576 
Pro Cys Asp Leu Asp Phe Lys Gly Thr Gin Cys Pro Asn Pro Ser Ser 
180 185 190 

tat att cct aag tta eta acg ace aat tea gac cac atg aca ttc ctg 624 
Tyr He Pro Lys Leu Leu Thr Thr Asn Ser Asp His Met Thr Phe Leu 
195 200 205 

caa agg gtc aag aac atg etc tac cct ctg gcc ctg tec tac att tgc 672 
Gin Arg Val Lys Asn Met Leu Tyr Pro Leu Ala Leu Ser Tyr He Cys 
210 215 220 

cat act ttt tct gcc cct tat gca agt ctt gcc tct gag ctt ttt cag 720 
His Thr Phe Ser Ala Pro Tyr Ala Ser Leu Ala Ser Glu Leu Phe Gin 
225 230 235 240 

aga gag gtg tea gtg gtg gat ctt gtc age tat gca tec gtg tgg ctg 768 
Arg Glu Val Ser Val Val Asp Leu Val Ser Tyr Ala Ser Val Trp Leu 
245 250 255 

ttc cga ggg gac ttt gtg atg gac tac ccc agg ccg ate atg ccc aac 816 
Phe Arg Gly Asp Phe Val Met Asp Tyr Pro Arg Pro He Met Pro Asn 
260 265 270 

atg gtc ttc att ggg ggc ate aac tgt gcc aac ggg aag cca eta tct 864 
Met Val Phe He Gly Gly He Asn Cys Ala Asn Gly Lys Pro Leu Ser 
27 5 280 285 



cag 
Gin 



867 



<210> 6 
<211> 289 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Met Ala Arg Gly Leu Gin Val Pro 

1 5 
Leu Leu Leu Leu Ser Val Gin Pro 
20 

Val Val Pro Thr Asp Gly Ser Pro 

35 40 
Arg Glu Leu His Ala Arg Gly His 
50 55 



Leu Pro Arg Leu Ala Thr Gly Leu 

10 15 
Trp Ala Glu Ser Gly Lys Val Leu 
25 30 
Trp Leu Ser Met Arg Glu Ala Leu 
45 

Gin Ala Val Val Leu Thr Pro Glu 
60 
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Val Asn Met His He Lys Glu Glu Lys Phe Phe Thr Leu Thr Ala Tyr 
6 5 70 75 80 

Ala Val Pro Trp Thr Gin Lys Glu Phe Asp Arg Val Thr Leu Gly Tyr 

85 90 95 

Thr Gin Gly Phe Phe Glu Thr Glu His Leu Leu Lys Arg Tyr Ser Arg 

100 105 no 

Ser Met Ala He Met Asn Asn Val Ser Leu Ala Leu His Arq Cys Cys 

115 120 125 

Val Glu Leu Leu His Asn Glu Ala Leu He Arg His Leu Asn Ala Thr 

130 135 140 

Ser Phe Asp Val Val Leu Thr Asp Pro Val Asn Leu Cys Gly Ala Val 
145 150 155 160 

Leu Ala Lys Tyr Leu Ser He Pro Ala Val Phe Phe Trp Arg Tyr lie 

165 170 175 

Pro Cys Asp Leu Asp Phe Lys Gly Thr Gin Cys Pro Asn Pro Ser Ser 

180 185 190 

Tyr He Pro Lys Leu Leu Thr Thr Asn Ser Asp His Met Thr Phe Leu 

195 200 205 

Gin Arg Val Lys Asn Met Leu Tyr Pro Leu Ala Leu Ser Tyr He Cys 

210 215 220 

His Thr Phe Ser Ala Pro Tyr Ala Ser Leu Ala Ser Glu Leu Phe Gin 
225 230 235 240 

Arg Glu Val Ser Val Val Asp Leu Val Ser Tyr Ala Ser Val Trp Leu 

245 250 255 

Phe Arg Gly Asp Phe Val Met Asp Tyr Pro Arg Pro He Met Pro Asn 

260 265 270 

Met Val Phe He Gly Gly He Asn Cys Ala Asn Gly Lys Pro Leu Se^ 
275 280 285 

Gin 



<210> 7 

<211> 867 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (867) 

<400> 7 

atg gcc aca gga etc cag gtt ccc ctg ccg cag ctg gec aca gga ctg 4 8 

Met Ala Thr Gly Leu Gin Val Pro Leu Pro Gin Leu Ala Thr Gly Leu 

15 10 15 

ctg ctt etc etc agt gtc cag ccc tgg get gag agt ggg aag gtg ctg 96 
Leu Leu Leu Leu Ser Val Gin Pro Trp Ala Glu Ser Gly Lys Val Leu 
20 25 30 

gtg gtg ccc act gat ggc age cac tgg etc age atg egg gag gcc ttg 14 4 

Val Val Pro Thr Asp Gly Ser His Trp Leu Ser Met Arg Glu Ala Leu 
35 40 45 

egg gac etc cat gcg aga ggc cac cag gtg gtg gtc etc acc ctg gag 192 
Arg Asp Leu His Ala Arg Gly His Gin Val Val Val Leu Thr Leu Glu 
50 55 60 

gtg aat atg tac ate aaa gaa gag aac ttt ttc acc ctg aca acg tat 240 
Val Asn Met Tyr He Lys Glu Glu Asn Phe Phe Thr Leu Thr Thr Tyr 
65 70 75 80 

gcc att tea tgg acc cag gac gaa ttt gat cgc ctt ttg ctg ggt cac 288 
Ala lie Ser Trp Thr Gin Asp Glu Phe Asp Arg Leu Leu Leu Gly His 
85 90 95 
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act caa teg ttc ttt gaa aca gaa cat ctt ctg atg aaa ttt tct aga 
Thr Gin Ser Phe Phe Glu Thr Glu His Leu Leu Met Lys Phe Ser Arg 
100 105 no 



336 



aga atg gca att atg aac aat atg tct ttg ate ata cat agg tct tgt 384 
Arg Met Ala He Met Asn Asn Met Ser Leu He He His Arq Ser Cys 
115 120 125 

gtg gag eta ctg cat aat gag gec ctg ate agg cac ctg cat get act 4 32 

Val Glu Leu Leu His Asn Glu Ala Leu He Arg His Leu His Ala Thr 
130 135 140 

tec ttt gat gtg gtt eta aca gac ccc ttt cac etc tgc gcg gcg gtg 4 80 

Ser Phe Asp Val Val Leu Thr Asp Pro Phe His Leu Cys Ala Ala Val 
145 150 155 160 

ctg get aag tac ctg teg att cct get gtg ttt ttc ttg agg aac att 528 
Leu Ala Lys Tyr Leu Ser lie Pro Ala Val Phe Phe Leu Arg Asn lie 
165 170 175 

cca tgt gat tta gac ttt aag ggc aca cag tgt cca aac cct tec tec 576 
Pro Cys Asp Leu Asp Phe Lys Gly Thr Gin Cys Pro Asn Pro Ser Ser 
180 185 190 

tat att cct aga tta eta acg ace aat tea gac cac atg aca ttc ctg 624 
Tyr lie Pro Arg Leu Leu Thr Thr Asn Ser Asp His Met Thr Phe Leu 
195 200 205 

caa agg gtc aag aac atg etc tac cct ctg gee ctg tec tac ctt tgc 672 
Gin Arg Val Lys Asn Met Leu Tyr Pro Leu Ala Leu Ser Tyr Leu Cys 
210 215 220 

cat get gtt tct get cct tat gca age ctt gee tct gag ctt ttt cag 720 
His Ala Val Ser Ala Pro Tyr Ala Ser Leu Ala Ser Glu Leu Phe Gin 
225 230 235 240 

aga gag gtg tea gtg gtg gat ctt gtc age cat gca tct gtg tgg ctg 768 
Arg Glu Val Ser Val Val Asp Leu Val Ser His Ala Ser Val Trp Leu 
245 250 255 

ttc cga ggg gac ttt gtg atg gat tac ccc agg ccg ate atg ccc aac 816 
Phe Arg Gly Asp Phe Val Met Asp Tyr Pro Arg Pro lie Met Pro Asn 
260 265 270 

atg gtc ttc att ggg ggc ate aac tgt gee aac ggg aag cca eta tct 864 
Met Val Phe He Gly Gly He Asn Cys Ala Asn Gly Lys Pro Leu Ser 
275 280 285 

cag 867 
Gin 



<210> 8 
<211> 289 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Met Ala Thr Gly Leu Gin Val Pro 

1 5 
Leu Leu Leu Leu Ser Val Gin Pro 
20 



Leu Pro Gin Leu Ala Thr Gly Leu 

10 15 

Trp Ala Glu Ser Gly Lys Val Leu 
25 30 



-8- 



WO 99/57322 PCT/US99/09702 

Val Val Pro Thr Asp Gly Ser His Trp Leu Ser Met Arg Glu Ala Leu 

35 40 45 

Arg Asp Leu His Ala Arg Gly His Gin Val Val Val Leu Thr Leu Glu 

50 55 60 

Val Asn Met Tyr He Lys Glu Glu Asn Phe Phe Thr Leu Thr Thr Tyr 
65 70 75 80 

Ala He Ser Trp Thr Gin Asp Glu Phe Asp Arg Leu Leu Leu Gly His 

85 90 95 

Thr Gin Ser Phe Phe Glu Thr Glu His Leu Leu Met Lys Phe Ser Arg 

100 105 HO 

Arg Met Ala He Met Asn Asn Met Ser Leu He He His Arg Ser Cys 

115 120 125 

Val Glu Leu Leu His Asn Glu Ala Leu He Arg His Leu His Ala Thr 

130 135 140 

Ser Phe Asp Val Val Leu Thr Asp Pro Phe His Leu Cys Ala Ala Val 
145 150 155 ~ 160 

Leu Ala Lys Tyr Leu Ser He Pro Ala Val Phe Phe Leu Arg Asn He 

165 170 175 

Pro Cys Asp Leu Asp Phe Lys Gly Thr Gin Cys Pro Asn Pro Ser Ser 

180 185 190 

Tyr He Pro Arg Leu Leu Thr Thr Asn Ser Asp His Met Thr Phe Leu 

195 200 205 

Gin Arg Val Lys Asn Met Leu Tyr Pro Leu Ala Leu Ser Tyr Leu Cys 

210 215 220 

His Ala Val Ser Ala Pro Tyr Ala Ser Leu Ala Ser Glu Leu Phe Gin 
225 230 235 240 

Arg Glu Val Ser Val Val Asp Leu Val Ser His Ala Ser Val Trp Leu 

245 250 255 

Phe Arg Gly Asp Phe Val Met Asp Tyr Pro Arg Pro He Met Pro Asn 

260 265 270 

Met Val Phe He Gly Gly He Asn Cys Ala Asn Gly Lys Pro Leu Ser 
275 280 285 

Gin 



<210> 9 

<211> 861 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (861) 

<400> 9 

atg gcc tgc etc ctt cgc tea ttt cag aga att tct gca ggg gtt ttc 4 8 

Met Ala Cys Leu Leu Arg Ser Phe Gin Arg He Ser Ala Gly Val Phe 
15 10 15 

ttc tta gca ctt tgg ggc atg gtt gta ggt gac aag ctg ctg gtg gtc 96 
Phe Leu Ala Leu Trp Gly Met Val Val Gly Asp Lys Leu Leu Val Val 
20 25 30 

cct cag gac gga age cac tgg ctt agt atg aag gat ata gtt gag gtt 14 4 

Pro Gin Asp Gly Ser His Trp Leu Ser Met Lys Asp He Val Glu Val 
35 40 45 

etc agt gac egg ggt cat gag att gta gtg gtg gtg cct gaa gtt aat 192 
Leu Ser Asp Arg Gly His Glu He Val Val Val Val Pro Glu Val Asn 
50 55 60 

ttg ctt ttg aaa gaa tec aaa tac tac aca aga aaa ate tat cca gtg 240 
Leu Leu Leu Lys Glu Ser Lys Tyr Tyr Thr Arg Lys lie Tyr Pro Val 
65 70 75 80 
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ccg tat gac caa gaa gag ctg aag aac cgt tac caa tea ttt gga aac 288 
Pro Tyr Asp Gin Glu Glu Leu Lys Asn Arg Tyr Gin Ser Phe Gly Asn 
85 90 95 

aat cac ttt get gag cga tea ttc eta act get cct cag aca gag tac 336 
Asn His Phe Ala Glu Arg Ser Phe Leu Thr Ala Pro Gin Thr Glu Tyr 
100 105 HO 

agg aat aac atg att gtt att ggc ctg tac ttc ate aac tgc cag age 384 
Arg Asn Asn Met lie Val He Gly Leu Tyr Phe He Asn Cys Gin Ser 
115 120 125 

etc ctg cag gac agg gac acc ctg aac ttc ttt aag gag age aag ttt 4 32 

Leu Leu Gin Asp Arg Asp Thr Leu Asn Phe Phe Lys Glu Ser Lys Phe 
130 135 140 

gat get ctt ttc aca gac cca gee tta ccc tgt ggg gtg ate ctg get 4 80 

Asp Ala Leu Phe Thr Asp Pro Ala Leu Pro Cys Gly Val He Leu Ala 
145 150 155 160 

gag tat ttg ggc eta cca tct gtg tac etc ttc agg ggt ttt ccg tgt 528 
Glu Tyr Leu Gly Leu Pro Ser Val Tyr Leu Phe Arg Gly Phe Pro Cys 
165 170 175 

tec ctg gag cat aca ttc age aga age cca gac cct gtg tec tac att 576 
Ser Leu Glu His Thr Phe Ser Arg Ser Pro Asp Pro Val Ser Tyr He 
180 185 190 

ccc agg tgc tac aca aag ttt tea gac cac atg act ttt tec caa cga 624 
Pro Arg Cys Tyr Thr Lys Phe Ser Asp His Met Thr Phe Ser Gin Arg 
195 200 205 

gtg gee aac ttc ctt gtt aat ttg ttg gag ccc tat eta ttt tat tgt 672 
Val Ala Asn Phe Leu Val Asn Leu Leu Glu Pro Tyr Leu Phe Tyr Cys 
210 215 220 

ctg ttt tea aag tat gaa gaa etc gca tea get gtc etc aag aga gat 720 
Leu Phe Ser Lys Tyr Glu Glu Leu Ala Ser Ala Val Leu Lys Arg Asp 
22 5 230 235 240 

•gtg gat ata ate acc tta tat cag aag gtc tct gtt tgg ctg tta aga 768 
Val Asp He He Thr Leu Tyr Gin Lys Val Ser Val Trp Leu Leu Arg 
245 250 255 

tat gac ttt gtg ctt gaa tat cct agg ccg gtc atg ccc aac atg gtc 816 
Tyr Asp Phe Val Leu Glu Tyr Pro Arg Pro Val Met Pro Asn Met Val 
260 265 270 

ttc att gga ggt ate aac tgt aag aag agg aaa gac ttg tct cag 861 
Phe He Gly Gly He Asn Cys Lys Lys Arg Lys Asp Leu Ser Gin 
275 280 285 



<210> 10 
<211> 287 
<212> PRT 

<213> Homo sapiens 

<400> 10 
Met Ala Cys Leu Leu Arg Ser Phe 

1 5 
Phe Leu Ala Leu Trp Gly Met Val 
20 



Gin Arg He Ser Ala Gly Val Phe 

10 15 

Val Gly Asp Lys Leu Leu Val Val 
25 30 
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Pro Gin Asp Gly Ser His Trp Leu Ser Met Lys Asp He Val Glu Val 

35 40 . " 45 

Leu Ser Asp Arg Gly His Glu He Val Val Val Val Pro Glu Val Asn 

50 55 60 

Leu Leu Leu Lys Glu Ser Lys Tyr Tyr Thr Arg Lys He Tyr Pro Val 
65 70 75 80 

Pro Tyr Asp Gin Glu Glu Leu Lys Asn Arg Tyr Gin Ser Phe Gly Asn 

85 90 95 

Asn His Phe Ala Glu Arg Ser Phe Leu Thr Ala Pro Gin Thr Glu Tyr 

100 105 110 

Arg Asn Asn Met He Val lie Gly Leu Tyr Phe He Asn Cys Gin Ser 

115 120 125 

Leu Leu Gin Asp Arg Asp Thr Leu Asn Phe Phe Lys Glu Ser Lys Phe 

130 135 140 

Asp Ala Leu Phe Thr Asp Pro Ala Leu Pro Cys Gly Val He Leu Ala 
145 150 155 160 

Glu Tyr Leu Gly Leu Pro Ser Val Tyr Leu Phe Arg Gly Phe Pro Cys 

165 170 J 175 

Ser Leu Glu His Thr Phe Ser Arg Ser Pro Asp Pro Val Ser Tyr lie 

180 185 ' 190 

Pro Arg Cys Tyr Thr Lys Phe Ser Asp His Met Thr Phe Ser Gin Arg 

195 200 205 

Val Ala Asn Phe Leu Val Asn Leu Leu Glu Pro Tyr Leu Phe Tyr Cys 

210 215 220 

Leu Phe Ser Lys Tyr Glu Glu Leu Ala Ser Ala Val Leu Lys Arg Asp 
225 230 235 240 

Val Asp He lie Thr Leu Tyr Gin Lys Val Ser Val Trp Leu Leu Arg 

24 5 250 255 

Tyr Asp Phe Val Leu Glu Tyr Pro Arg Pro Val Met Pro Asn Met Val 

260 265 270 

Phe He Gly Gly He Asn Cys Lys Lys Arg Lys Asp Leu Ser Gin 
275 280 285 

<210> 11 

<211> 951 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (951) 

<400> 11 

atg get cgt gca ggg tgg act ggc etc ctt ccc eta tat gtg tgt eta 4 8 

Met Ala Arg Ala Gly Trp Thr Gly Leu Leu Pro Leu Tyr Val Cys Leu 

15 10 15 

ctg ctg acc tgt get ttg cca agg tea ggg aag ctg ctg gta gtg ccc 96 
Leu Leu Thr Cys Ala Leu Pro Arg Ser Gly Lys Leu Leu Val Val Pro 
20 25 30 

atg gat ggg age cac tgg ttc acc atg cag teg gtg gtg gag aaa etc 144 
Met Asp Gly Ser His Trp Phe Thr Met Gin Ser Val Val Glu Lys Leu 
35 4 0 4 5 

ate etc agg ggg cat gag gtg gtc gta gtc atg cca gag gtg agt tgg 192 
He Leu Arg Gly His Glu Val Val Val Val Met Pro Glu Val Ser Trp 
50 55 60 

caa ctg gga aga tea ctg aat tgc aca gtg aag act tac tea acc tea 240 
Gin Leu Gly Arg Ser Leu Asn Cys Thr Val Lys Thr Tyr Ser Thr Ser 
65 70 75 80 
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tac act ctg gag gat cag gac egg gag ttc atg gtt ttt gec gat get 288 

Tyr Thr Leu Glu Asp Gin Asp Arg Glu Phe Met Val Phe Ala Asp Ala 

85 90 95 



cgc tgg acg gca cca ttg cga agt gca ttt tct eta tta aca agt tea 336 
Arg Trp Thr Ala Pro Leu Arg Ser Ala Phe Ser Leu Leu Thr Ser Ser 
100 105 HO 



tec aat ggt att ttt gac tta ttt ttt tea aat tgc agg agt ttg ttt 384 
Ser Asn Gly lie Phe Asp Leu Phe Phe Ser Asn Cys Arg Ser Leu Phe 
115 120 125 

aat gac cga aaa tta gta gaa tac tta aag gag agt tgt ttt gat gca 432 
Asn Asp Arg Lys Leu Val Glu Tyr Leu Lys Glu Ser Cys Phe Asp Ala 
130 135 140 

gtg ttt etc gat cct ttt gat cgc tgt ggc tta att gtt gee aaa tat 480 
Val Phe Leu Asp Pro Phe Asp Arg Cys Gly Leu He Val Ala Lys Tyr 
145 150 155 160 

ttc tec etc ccc tct gtg gtc ttc gee agg gga ata ttt tgc cac tat 528 
Phe Ser Leu Pro Ser Val Val Phe Ala Arg Gly He Phe Cys His Tyr 
165 170 175 



ctt gaa gaa ggt gca cag tgc cct get cct ctt tec tat gtc ccc aga 576 
Leu Glu Glu Gly Ala Gin Cys Pro Ala Pro Leu Ser Tyr Val Pro Arg 
180 185 190 

ctt etc tta ggg ttc tea gac gec atg act ttc aag gag aga gta tgg 624 
Leu Leu Leu Gly Phe Ser Asp Ala Met Thr Phe Lys Glu Arg Val Trp 
195 200 205 

aac cac ate atg cac ttg gag gaa cat tta ttt tgc ccc tat ttt ttc 672 
Asn His He Met His Leu Glu Glu His Leu Phe Cys Pro Tyr Phe Phe 
210 215 220 

aaa aat gtc tta gaa ata gec tct gaa att etc caa acc cct gtc acg 720 
Lys Asn Val Leu Glu He Ala Ser Glu lie Leu Gin Thr Pro Val Thr 
225 230 235 240 

gca tat gat etc tac age cac aca tea att tgg ttg ttg cga act gac 768 
Ala Tyr Asp Leu Tyr Ser His Thr Ser He Trp Leu Leu Arg Thr Asp 
245 250 255 

ttt gtt ttg gag tat ccc aaa ccc gtg atg ccc aat atg ate ttc att 816 
Phe Val Leu Glu Tyr Pro Lys Pro Val Met Pro Asn Met lie Phe He 
260 265 270 



ggt ggt ate aac tgt cat cag gga aag cca gtg cct atg gta agt tat 864 

Gly Gly He Asn Cys His Gin Gly Lys Pro Val Pro Met Val Ser Tyr 
275 280 285 

etc ccc ttt age aca tta aga ata ate tgg ctt tgg aaa tta aaa gat 912 

Leu Pro Phe Ser Thr Leu Arg He lie Trp Leu Trp Lys Leu Lys Asp 
290 295 * 300 



ttc tta cag aat cat aat tta tea ttt aca ttt gtc cca 951 
Phe Leu Gin Asn His Asn Leu Ser Phe Thr Phe Val Pro 
305 310 315 



<210> 12 
<211> 317 
<212> PRT 
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<213> Homo sapiens 

<400> 12 

Met Ala Arg Ala Gly Trp Thr Gly Leu Leu Pro Leu Tyr Val Cys Leu 

15 10 15 

Leu Leu Thr Cys Ala Leu Pro Arg Ser Gly Lys Leu Leu Val Val Pro 

20 25 ~ J 30 

Met Asp Gly Ser His Trp Phe Thr Met Gin Ser Val Val Glu Lys Leu 

35 40 45 

He Leu Arg Gly His Glu Val Val Val Val Met Pro Glu Val Ser Trp 

50 55 60 

Gin Leu Gly Arg Ser Leu Asn Cys Thr Val Lys Thr Tyr Ser Thr Ser 
65 "70 75 " 80 

Tyr Thr Leu Glu Asp Gin Asp Arg Glu Phe Met Val Phe Ala Asp Ala 

85 90 95 

Arg Trp Thr Ala Pro Leu Arg Ser Ala Phe Ser Leu Leu Thr Ser Ser 

100 105 HO 

Ser Asn Gly He Phe Asp Leu Phe Phe Ser Asn Cys Arg Ser Leu Phe 

115 120 125 

Asn Asp Arg Lys Leu Val Glu Tyr Leu Lys Glu Ser Cys Phe Asp Ala 

130 135 ^ 140 

Val Phe Leu Asp Pro Phe Asp Arg Cys Gly Leu He Val Ala Lys Tyr 
I 45 150 155 160 

Phe Ser Leu Pro Ser Val Val Phe Ala Arg Gly He Phe Cys His Tyr 

165 170 17 5 

Leu Glu Glu Gly Ala Gin Cys Pro Ala Pro Leu Ser Tyr Val Pro Arg 

180 185 190 

Leu Leu Leu Gly Phe Ser Asp Ala Met Thr Phe Lys Glu Arg Val Trp 

195 200 205 

Asn His He Met His Leu Glu Glu His Leu Phe Cys Pro Tyr Phe Phe 

210 215 220 

Lys Asn Val Leu Glu He Ala Ser Glu He Leu Gin Thr Pro Val Thr 
225 230 235 240 

Ala Tyr Asp Leu Tyr Ser His Thr Ser He Trp Leu Leu Arg Thr Asp 

245 250 255 

Phe Val Leu Glu Tyr Pro Lys Pro Val Met Pro Asn Met He Phe He 

260 265 270 

Gly Gly He Asn Cys His Gin Gly Lys Pro Val Pro Met Val Ser Tyr 

275 280 285 

Leu Pro Phe Ser Thr Leu Arg He He Trp Leu Trp Lys Leu Lys Asp 

2 90 295 300 

Phe Leu Gin Asn His Asn Leu Ser Phe Thr Phe Val Pro 
305 310 315 

<210> 13 

<211> 930 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (930) 

<400> 13 

atg get cgc aca ggg tgg acc age ccc att ccc eta tgt gtt tct ctg 48 

Met Ala Arg Thr Gly Trp Thr Ser Pro He Pro Leu Cys Val Ser Leu 

15 10 15 

ctg ctg acc tgt ggc ttt get gag gca ggg aag ctg ctg gta gtg ccc 96 
Leu Leu Thr Cys Gly Phe Ala Glu Ala Gly Lys Leu Leu Val Val Pro 
20 25 30 
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atg gat ggg agt cac tgg ttc acc atg cag teg gtg gtg gag aaa ctt 144 
Met Asp Gly Ser His Trp Phe Thr. Met Gin Ser Val Val Glu Lys Leu 
35 40 45 

ate etc agg ggg cat gag gtg gtt gta gtc atg cca gag gtg agt tgg 192 
He Leu Arg Gly His Glu Val Val Val Val Met Pro Glu Val Ser Trp 
50 55 60 

caa ctg gga aaa tea ctg aat tgc aca gtg aag act tac tea acc tea 240 
Gin Leu Gly Lys Ser Leu Asn Cys Thr Val Lys Thr Tyr Ser Thr Ser 
65 70 75 80 

tac act ctg gag gat ctg gac egg gaa ttc atg gat ttc gee gat get 288 
Tyr Thr Leu Glu Asp Leu Asp Arg Glu Phe Met Asp Phe Ala Asp Ala 
85 90 95 

caa tgg aaa gca caa gta cga agt ttg ttt tct eta. ttt ctg agt tea 336 
Gin Trp Lys Ala Gin Val Arg Ser. Leu Phe Ser Leu Phe Leu Ser Ser 
100 105 no 

tec aat ggt ttt ttt aac tta ttt ttt teg cat tgc agg agt ttg ttt 384 
Ser Asn Gly Phe Phe Asn Leu Phe Phe Ser His Cys Arg Ser Leu Phe 
115 120 125 

aat gac cga aaa tta gta gaa tac tta aag gag agt tct ttt gat gcg 432 
Asn Asp Arg Lys Leu Val Glu Tyr Leu Lys Glu Ser Ser Phe Asp Ala 
130 135 140 

gtg ttt ctt gat cct ttt gat gee tgt gcg tta att gtt gee aaa tat 480 
Val Phe Leu Asp Pro Phe Asp Ala Cys Ala Leu He Val Ala Lys Tyr 
145 150 155 160 

ttc tec etc ccc tct gtg gtc ttc gee agg gga ata ggt tgc cac tat 528 
Phe Ser Leu Pro Ser Val Val Phe Ala Arg Gly He Gly Cys His Tyr 
165 170 175 

ctt gaa gaa ggt gca cag tgc cct get cct ctt tec tat gtc ccc aga 576 
Leu Glu Glu Gly Ala Gin Cys Pro Ala Pro Leu Ser Tyr Val Pro Arg 
180 185 190 

att etc tta ggg ttc tea gat gee atg act ttc aag gag aga gta egg 624 
He Leu Leu Gly Phe Ser Asp Ala Met Thr Phe Lys Glu Arg Val Arg 
195 200 205 

aac cac ate atg cac ttg gag gaa cat tta ttt tgc cag tat ttt tec 672 
Asn His He Met His Leu Glu Glu His Leu Phe Cys Gin Tyr Phe Ser 
210 215 220 

aaa aat gee eta gaa ata gec tct gaa att etc caa aca cct gtc aca 720 
Lys Asn Ala Leu Glu He Ala Ser Glu He Leu Gin Thr Pro Val Thr 
225 230 235 240 

gca tat gat etc tac age cac aca tea att tgg ttg ttg cga aca gac 768 
Ala Tyr Asp Leu Tyr Ser His Thr Ser He Trp Leu Leu Arg Thr Asp 
245 250 255 

ttt gtt ttg gac tat ccc aaa ccc gtg atg ccc aat atg ate ttc att 816 
Phe Val Leu Asp Tyr Pro Lys Pro Val Met Pro Asn Met He Phe He 
260 265 270 

ggt ggt ate aac tgc cat cag gga aag cca ttg cct atg gta agt cac 864 
Gly Gly He Asn Cys His Gin Gly Lys Pro Leu Pro Met Val Ser His 
275 280 285 
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etc tec ttt age aca tta gga ata ate ttg get ttg gaa att aaa aaa 912 
Leu Ser Phe Ser Thr Leu Gly lie lie Leu Ala Leu Glu He Lys Lys 
290 295 300 

aga ttc ctt act gaa ttg 930 
Arg Phe Leu Thr Glu Leu 
305 310 



<210> 14 

<211> 310 

<212> PRT 

<213> Homo sapiens 

<400> 14 

Met Ala Arg Thr Gly Trp Thr Ser Pro He Pro Leu Cys Val Ser Leu 

1 5 10 15 

Leu Leu Thr Cys Gly Phe Ala Glu Ala Gly Lys Leu Leu Val Val Pro 

20 25 30 

Met Asp Gly Ser His Trp Phe Thr Met Gin Ser Val Val Glu Lys Leu 

35 4 0 45 

He Leu Arg Gly His Glu Val Val Val Val Met Pro Glu Val Ser Trp 

50 55 60 

Gin Leu Gly Lys Ser Leu Asn Cys Thr Val Lys Thr Tyr Ser Thr Ser 
65 70 75 " 80 

Tyr Thr Leu Glu Asp Leu Asp Arg Glu Phe Met Asp Phe Ala Asp Ala 

85 90 95 

Gin Trp Lys Ala Gin Val Arg Ser Leu Phe Ser Leu Phe Leu Ser Ser 

100 105 HO 

Ser Asn Gly Phe Phe Asn Leu Phe Phe Ser His Cys Arg Ser Leu Phe 

115 120 125 

Asn Asp Arg Lys Leu Val Glu Tyr Leu Lys Glu Ser Ser Phe Asp Ala 

130 135 * 140 

Val Phe Leu Asp Pro Phe Asp Ala Cys Ala Leu He Val Ala Lys Tyr 
145 150 155 160 

Phe Ser Leu Pro Ser Val Val Phe Ala Arg Gly He Gly Cys His Tyr 

165 170 175 

Leu Glu Glu Gly Ala Gin Cys Pro Ala Pro Leu Ser Tyr Val Pro Arg 

180 185 190 

He Leu Leu Gly Phe Ser Asp Ala Met Thr Phe Lys Glu Arg Val Arg 

195 200 205 

Asn His He Met His Leu Glu Glu His Leu Phe Cys Gin Tyr Phe Ser 

210 215 220 

Lys Asn Ala Leu Glu He Ala Ser Glu He Leu Gin Thr Pro Val Thr 
225 230 235 240 

Ala Tyr Asp Leu Tyr Ser His Thr Ser He Trp Leu Leu Arg Thr Asp 

245 250 255 

Phe Val Leu Asp Tyr Pro Lys Pro Val Met Pro Asn Met He Phe He 

260 265 270 

Gly Gly He Asn Cys His Gin Gly Lys Pro Leu Pro Met Val Ser His 

275 280 285 

Leu Ser Phe Ser Thr Leu Gly lie He Leu Ala Leu Glu He Lys Lys 

290 295 300 

Arg Phe Leu Thr Glu Leu 
305 310 

<210> 15 

<211> 759 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (759) 
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<400> 15 

atg gat ggg agt cac tgg ttc acc atg cag teg gtg gtg gag aaa ctt 4 8 

Met Asp Gly Ser His Trp Phe Thr Met Gin Ser Val Val Glu Lys Leu' 
15 10 15 

ate etc agg ggg cat gag gtg gtt gta gtc atg cca gag gtg agt tgg 96 
He Leu Arg Gly His Glu Val Val Val Val Met Pro Glu Val Ser Trp 
20 25 30 

caa ctg gaa aga tea ctg aat tgc aca gtg aag act tac tea acc teg 14 4 

Gin Leu Glu Arg Ser Leu Asn Cys Thr Val Lys Thr Tyr Ser Thr Ser 
35 40 45 

tac act ctg gaa gat cag aac egg gaa ttc atg gtt ttc gec cat get 192 
Tyr Thr Leu Glu Asp Gin Asn Arg Glu Phe Met Val Phe Ala His Ala 
50 55 60 

caa tgg aaa gca cag gca caa agt ata ttt tct eta tta atg agt tea 240 
Gin Trp Lys Ala Gin Ala Gin Ser He Phe Ser Leu Leu Met Ser Ser 
65 ^ 70 75 80 

tec agt ggt ttt ctt gac tta ttt ttt teg cat tgc agg agt ttg ttt 288 
Ser Ser Gly Phe Leu Asp Leu Phe Phe Ser His Cys Arg Ser Leu Phe 
85 90 95 

aat gac cga aaa tta gta gaa tac tta aag gag agt tct ttt gat gca 336 
Asn Asp Arg Lys Leu Val Glu Tyr Leu Lys Glu Ser Ser Phe Asp Ala 
100 105 110 

gtg ttt ctg gat cct ttt gat acc tgt ggc tta att gtt get aaa tat 384 
Val Phe Leu Asp Pro Phe Asp Thr Cys Gly Leu He Val Ala Lys Tyr 
115 120 125 

ttc tec etc ccc tct gtg gtc ttc acc agg gga ata ttt tgc cac cat 432 
Phe Ser Leu Pro Ser Val Val Phe Thr Arg Gly He Phe Cys His His 
130 135 140 

ctt gaa gaa ggt gca cag tgc cct get cct ctt tec tat gtc ccc aat 4 80 

Leu Glu Glu Gly Ala Gin Cys Pro Ala Pro Leu Ser Tyr Val Pro Asn 
145 150 155 160 

gat etc tta ggg ttc tea gat gee atg act ttc aag gag aga gta tgg 528 
Asp Leu Leu Gly Phe Ser Asp Ala Met Thr Phe Lys Glu Arg Val Trp 
165 170 175 

aac cac ate gtg cac ttg gag gac cat tta ttt tgc cag tat ctt ttt 576 
Asn His lie Val His Leu Glu Asp His Leu Phe Cys Gin Tyr Leu Phe 
180 185 190 

aga aat gee eta gaa ata gee tct gaa att etc caa acc cct gtc acg 624 
Arg Asn Ala Leu Glu He Ala Ser Glu He Leu Gin Thr Pro Val Thr 
195 200 205 

gca tat gat etc tac agt cac aca tea att tgg ttg ttg cga acg gac 672 
Ala Tyr Asp Leu Tyr Ser His Thr Ser He Trp Leu Leu Arg Thr Asp 
210 215 220 

ttt gtt ttg gac tat ccc aaa ccc gtg atg ccc aac atg ate ttc att 720 
Phe Val Leu Asp Tyr Pro Lys Pro Val Met Pro Asn Met He Phe He 
225 230 235 240 

ggt ggt ate aac tgt cat cag gga aag cca ttg cct atg 759 
Gly Gly He Asn Cys His Gin Gly Lys Pro Leu Pro Met 
245 250 
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<210> 16 
<211> 253 
<212> PRT 

<213> Homo sapiens 
<400> 16 

Met Asp Gly Ser His Trp Phe Thr Met Gin Ser Val Val Glu Lys Leu 

1 5 10 15 

He Leu Arg Gly His Glu Val Val Val Val Met Pro Glu Val Ser Trp 

20 25 30 

Gin Leu Glu Arg Ser Leu Asn Cys Thr Val Lys Thr Tyr Ser Thr Ser 

35 40 45 

Tyr Thr Leu Glu Asp Gin Asn Arg Glu Phe Met Val Phe Ala His Ala 

50 55 60 

Gin Trp Lys Ala Gin Ala Gin Ser He Phe Ser Leu Leu Met Ser Ser 
65 " 70 75 80 

Ser Ser Gly Phe Leu Asp Leu Phe Phe Ser His Cys Arg Ser Leu Phe 

85 90 95 

Asn Asp Arg Lys Leu Val Glu Tyr Leu Lys Glu Ser Ser Phe Asp Ala 

100 105 110 

Val Phe Leu Asp Pro Phe Asp Thr Cys Gly Leu He Val Ala Lys Tyr 

115 120 125 

Phe Ser Leu Pro Ser Val Val Phe Thr Arg Gly He Phe Cys His His 

130 135 140 

Leu Glu Glu Gly Ala Gin Cys Pro Ala Pro Leu Ser Tyr Val Pro Asn 
145 150 155 160 

Asp Leu Leu Gly Phe Ser Asp Ala Met Thr Phe Lys Glu Arg Val Trp 

165 170 175 

Asn His He Val His Leu Glu Asp His Leu Phe Cys Gin Tyr Leu Phe 

180 185 190 

Arg Asn Ala Leu Glu He Ala Ser Glu He Leu Gin Thr Pro Val Thr 

195 200 205 

Ala Tyr Asp Leu Tyr Ser His Thr Ser He Trp Leu Leu Arg Thr Asp 

210 215 220 

Phe Val Leu Asp Tyr Pro Lys Pro Val Met Pro Asn Met He Phe He 
225 230 ^ 235 240 

Gly Gly He Asn Cys His Gin Gly Lys Pro Leu Pro Met 
245 250 

<210> 17 

<211> 735 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (735) 

<400> 17 

gaa ttt gaa gcc tac att aat get tct gga gaa cat gga att gtg gtt 48 
Glu Phe Glu Ala Tyr He Asn Ala Ser Gly Glu His Gly He Val Val 
15 10 15 

ttc tct ttg gga tea atg gtc tea gaa att cca gag aag aaa get atg 96 
Phe Ser Leu Gly Ser Met Val Ser Glu He Pro Glu Lys Lys Ala Met 
20 25 30 

gca att get gat get ttg ggc aaa ate cct cag aca gtc ctg tgg egg 144 
Ala He Ala Asp Ala Leu Gly Lys He Pro Gin Thr Val Leu Trp Arg 
35 40 45 
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tac act gga acc cga cca teg aat ctt gcg aac aac acg ata ctt gtt 192 
Tyr Thr Gly Thr Arg Pro Ser Asn. Leu Ala Asn Asn Thr lie Leu Val 
50 55 60 

aag tgg eta ccc caa aac gat ctg ctt ggt cac ccg atg acc cgt gec 240 
Lys Trp Leu Pro Gin Asn Asp Leu Leu Gly His Pro Met Thr Arg Ala 
65 70 75 80 

ttt ate acc cat get ggt tec cat ggt gtt tat gaa age ata tgc aat 288 
Phe He Thr His Ala Gly Ser His Gly Val Tyr Glu Ser He Cys Asn 
85 90 95 

ggc gtt ccc atg gtg atg atg ccc ttg ttt ggt gat cag atg gac aat 336 
. Gly Val Pro Met Val Met Met Pro Leu Phe Gly Asp Gin Met Asp Asn 
100 105 110 

gca aag cgc atg gag act aag gga get gga gtg acc ctg aat gtt ctg 384 
Ala Lys Arg Met Glu Thr Lys Gly. Ala Gly Val Thr Leu Asn Val Leu 
115 120 125 

gaa atg act tct gaa gat tta gaa aat get eta aaa gca gtc ate aat 4 32 

Glu Met Thr Ser Glu Asp Leu Glu Asn Ala Leu Lys Ala Val He Asn 
130 135 140 

gac aaa agt tac aag gag aac ate atg cgc etc tec age ctt cac aag 480 
Asp Lys Ser Tyr Lys Glu Asn He Met Arg Leu Ser Ser Leu His Lys 
145 150 155 160 

gac cgc ccg gtg gag ccg ctg gac ctg gee gtg ttc tgg gtg gag ttt 528 
Asp Arg Pro Val Glu Pro Leu Asp Leu Ala Val Phe Trp Val Glu Phe 
165 170 175 

gtg atg agg cac aag ggc gcg cca cac ctg cgc ccc gca gee cac gac 576 
Val Met Arg His Lys Gly Ala Pro His Leu Arg Pro Ala Ala His Asp 
180 185 190 

etc acc tgg tac cag tac cat tec ttg gac gtg att ggt ttc etc ttg 624 
Leu Thr Trp Tyr Gin Tyr His Ser Leu Asp Val He Gly Phe Leu Leu 
195 200 " 205 

gee gtc gtg ctg aca gtg gee ttc ate acc ttt aaa tgt tgt get tat 672 
Ala Val Val- Leu Thr Val Ala Phe He Thr Phe Lys Cys Cys Ala Tyr 
210 215 220 

ggc tac egg aaa tgc ttg ggg aaa aaa ggg cga gtt aag aaa gec cac 720 
Gly Tyr Arg Lys Cys Leu Gly Lys Lys Gly Arg Val Lys Lys Ala His 
225 230 235 ~ 240 

aaa tec aag acc cat 735 
Lys Ser Lys Thr His 
245 



<210> 18 
<211> 245 
<212> PRT 

<213> Homo sapiens 
<400> 18 

Glu Phe Glu Ala Tyr He Asn Ala Ser Gly Glu His Gly He Val Val 

15 10 15 

Phe Ser Leu Gly Ser Met Val Ser Glu He Pro Glu Lys Lys Ala Met 
20 25 " 30 



-18- 



WO 99/57322 



Ala 


He 


Ala Asp Ala 


Leu Gly Lys 


He 






35 








40 • 




Tyr 


Thr 


Gly Thr Arg 


Pro Ser 


Asn 


Leu 


50 








55 






Lys 


Trp 


Leu 


Pro 


Gin 


Asn Asp 


Leu 


Leu 


65 










70 






Phe 


He 


Thr 


His 


Ala 


Gly Ser 


His 


Gly 










85 








Gly 


Val 


Pro 


Met 


Val 


Met Met 


Pro 


Leu 






100 








105 


Ala 


Lys 


Arg 


Met 


Glu 


Thr Lys 


Gly Ala 




115 








120 




Glu 


Met 


Thr 


Ser 


Glu 


Asp Leu 


Glu 


Asn 




130 








135 






Asp 


Lys 


Ser 


Tyr 


Lys 


Glu Asn 


He 


Met 


145 










150 






Asp 


Arg 


Pro 


Val 


Glu 


Pro Leu 


Asp 


Leu 








165 








Val 


Met 


Arg 


His 


Lys 


Gly Ala 


Pro 


His 








180 








185 


Leu 


Thr 


Trp 


Tyr 


Gin 


Tyr His 


Ser 


Leu 






195 








200 




Ala 


Val 


Val 


Leu 


Thr 


Val Ala 


Phe 


He 




210 








215 






Gly 


Tyr 


Arg 


Lys 


Cys 


Leu Gly Lys 


Lys 


225 










230 






Lys 


Ser 


Lys 


Thr 


His 














245 









<210> 19 . 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 19 
tggtgtatcg attggtttt 

<210> 20 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 20 
catatatctg gggctagtta ate 

<210> 21 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 21 
acaaggtaat taagatgaag aaagca 

<210> 22 
<211> 20 
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Pro Gin 


Thr 


Val 
45 


Leu 


Trp 


Arg 


Ala Asn 


Asn 
60 


Thr 


He 


Leu 


Val 


Gly His 


Pro 


Met 


Thr 


Arg 


Ala 


75 










o r> 

y u 


Val Tyr 


Glu 


Ser 


He 


Cys 


Asn 


90 








95 




Phe Gly 


Asp 


Gin 


Met 
110 


Asp 


Asn 


Gly Val 


Thr 


Leu 


Asn 


Val 


Leu 




125 








Ala Leu 


Lys 
140 


Ala 


Val 


He 


Asn 


Arg Leu 


Ser 


Ser 


Leu 


His 


Lys 


155 










160 


Ala Val 


Phe 


Trp 


Val 


Glu 


Phe 


170 








175 




Leu Arg 


Pro 


Ala 


Ala 
i on 


His 


Asp 


Asp Val 


He 


Gly 
205 


Phe 


Leu 


Leu 


Thr Phe 


Lys 
220 


Cys 


Cys 


Ala 


Tyr 


Gly Arg 


Val 


Lys 


Lys 


Ala 


His 



235 240 



19 



23 
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<212> DNA 

<213> Artificial Sequence' 
<220> 

<223> Primer 

<400> 22 
acctgagata gtggcttcct 

<210> 23 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 23 
tttgtcttcc aattacatgc 

<210> 24 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 24 
agtagatatg gaagcacttg taag 

<210> 25 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 25 
tctcagtgac aaggtaatta agac 

<210> 26 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 26 
cattgattgg ataaaggca 

<210> 27 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 27 
aatttgggtt cttacatatc aa 
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<210> 28 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 28 
gagtgaggga ggacagag 

<210> 29 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 29 
ataagtacac gccttctttt g 

<210> 30 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 30 
gctgctttat acaatttgct ac 

<210> 31 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 31 
cgcctacgta tcatagcagt ta 

<210> 32 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 32 
ggaaagaaat ttgaaatgca ac 

<210> 33 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 33 
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18 



21 



22 



22 



-21- 



PCT/US99/09702 

WO 99/57322 

tctttccgcc tactgtatca 

<210> 34 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 34 
ttcaagaagg gcagttttat 

<210> 35 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 35 
ctctggcagg agcaaag 

<210> 36 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 36 
atacacacct gggatagtgg 

<210> 37 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 37 
ggtaattaag atgaagaaag ca 

<210> 38 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 38 
ctgagatagt ggcttcctg 

<210> 39 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 39 
gtggctcaat gacaagg 

<210> 40 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 40 
atatggaagc acttgtaagt aaa 

<210> 41 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 41 
ttaagacgaa ggaaacaatt ct 

<210> 42 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 42 
acctgagata gtggcttcc 

<210> 43 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 43 
atcaaagggt aaaattcaga 

<210> 44 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 44 
ggcagtccaa aagaaata 

<210> 45 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
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<223> Primer 

<400> 45 
ttttgagggc aggttcta 

<210> 46 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 46 
aatgggacaa atgtaaatga ta 

<210> 47 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 47 
ttctctcatg gctcgca 

<210> 48 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 48 
atgtcaaatc acaattcagt aagg 

<210> 49 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 49 
ccgcctactg tatcatagca 

<210> 50 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 50 
caacgaaatg tcaaatcaca g 

<210> 51 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
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18 



22 



17 



24 



20 
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<220> 

<223> Primer 

<400> 51 
ctctggcagg agcaaag 

<210> 52 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 52 
acagtgggca gagacag 

<210> 53 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 53 
gtggtttatt ccccgtat 

<210> 54 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 54 
atacacacct gggatagtgg 

<210> 55 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 55 
ggtaattaag atgaagaaag ca 

<210> 56 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 56 
gaaatggcat aggttgtc 

<210> 57 
<211> 17 
<212> DNA 
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<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 57 
ggccacactc aactgta 

<210> 58 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 58 
ctcaaaaaaa acacagtagg 

<210> 59 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 59 
actttttctg ccccttat 

<210> 60 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 60 
atatggaagc acttgtaagt aaa 

<210> 61 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 61 
ttaagacgaa ggaaacaatt ct 

<210> 62 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 62 
aatggcatac gttgtca 
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17 



20 



18 



23 



22 



<210> 63 



WO 99/57322 

<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 63 
agaatggcaa ttatgaaca 

<210> 64 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 64 
tgtgtgccct taaagtct 

<210> 65 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 65 
agaatggcaa ttatgaaca 

<210> 66 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 66 
acctgagata gtggcttcc 

<210> 67 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 67 
ctctggctct gtcctac 

<210> 68 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 68 
acctgagata gtggcttcc 
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19 



18 



19 



19 



17 
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<210> 69 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Primer 

<400> 69 
atcaaagggt aaaattcaga 

<210> 70 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 70 
cagcagcttg tcacctac 

<210> 71 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 71 
aatttgcttt tgaaagaatc 

<210> 72 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 72 
ggtaggccca aatactca 

<210> 73 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<40O> 73 
aatttgcttt tgaaagaatc 

<210> 74 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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20 



18 



20 



18 
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<400> 74 
ggcagtccaa aagaaata 
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<210> 75 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 75 
ttttgagggc aggttcta 

<210> 76 
<211> 18 
<212> DNA 

<213> Artificial Sequence. 
<220> 

<223> Primer 

<400> 76 
cacctctggc atgactac 

<210> 77 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 77 
ttgcaggagt ttgtttaat 

<210> 78 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 78 
aatgggacaa atgtaaatga ta 

<210> 79 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 79 
cattgcagga gtttgttta 

<210> 80 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Primer 
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<400> 80 
catctgagaa ccctaagaga 

<210> 81 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 81 
agaaatagcc tctgaaattc 

<210> 82 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 82 
atgtcaaatc acaattcagt aagg 

<210> 83 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 83 
ccgcctactg tatcatagca 

<210> 84 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 84 
gagtgtacga ggttgagtaa g 

<210> 85 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 85 
attttgccag tatcttttta g 

<210> 86 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 86 

caacgaaatg tcaaatcaca g 21 

<210> 87 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 87 

catcagagac agagcatttt acacctt 27 

<210> 88 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 88 

ggacctattg agccctgcat ctgtct 26 

<210> 89 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 89 

ggttcccctg ccgcggctgg ccaca 25 

<210> 90 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 90 

gccctgggct gaaagtggaa ag 22 

<210> 91 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 91 

atgcgggagg ccttgcggga get 23 

<210> 92 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 92 

ctctgcgcgg cggtgctggc taag 24 

<210> 93 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 93 

taccccaggc caatcatgcc caaca 25 

<210> 94 
<211> 27 
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<212> DNA 

<213> Homo sapiens 
<400> 94 

tccaggcaaa atacttttta aaaaatg 27 

<210> 95 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 95 

agcatgcggg aggcctcgcg gga 23 

<210> 96 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 96 

gcgggagctc catgcgagag g 21 

<210> 97 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 97 

tggtggtcct caccccggag gtgaa 25 

<210> 98 
<211> 26 
<212> DNA 

<213> Homo sapiens 
<400> 98 

tacatcaaag aggagaactt tttcac 26 

<210> 99 
<211> 26 
<212> DNA 

<213> Homo sapiens 
<400> 99 

tgatcaggca cctgaatgct acttcc 26 

<210> 100 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 100 

acctctgcgg ggcggtgctg g 21 

<210> 101 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 101 

aagaacatgc tttaccctct ggc 23 

<210> 102 
<211> 18 
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<212> DNA 

<213> Homo sapiens 

<400> 102 

ctctggctct gtcctacc 18 

<210> 103 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 103 

tcctaccttt gctatgctgt ttct 24 

<210> 104 

<211> 17 

<212> DNA 

<213> Homo sapiens 

<400> 104 

tgtcagtggt ggatatt 17 

<210> 105 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 105 

ggtggatatt ctcagc 16 

<210> 106 

<211> 13 

<212> DNA 

<213> Homo sapiens 

<400> 106 

tcagctatgc ate 23 

<210> 107 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 107 

gcatccgtgt ggctgttccg a 21 

<210> 108 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 108 

tggctgttcc gaegggaett 20 

<210> 109 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 109 

gggacttcgt gatgga 16 

<210> 110 
<211> 23 
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<212> DNA 

<213> Homo sapiens 

<400> 110 

gtgatggact accccaggcc gat 23 

<210> 111 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 111 

cctgcctcct tcgcgcattt cagag 25 

<210> 112 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 112 

gcgatcattc ctgactgctc ctcag 25 

<210> 113 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 113 

ccctggagca tgcattcagc ag 22 

<210> 114 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 114 

cattcagcag cagcccagac cct 23 

<210> 115 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 115 

tacttcttcc acgtactata tta 23 

<210> 116 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 116 

ggcctccttc cactatatgt gtgt 24 

<210> 117 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 117 

ggagagagta cggaaccaca t 21 

<210> 118 
<211> 23 
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<212> DNA 

<213> Homo sapiens 

<400> 118 

tcaatttggt tattgcgaac tga 23 

<210> 119 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 119 

caggggaata gcttgccact at 22 

<210> 120 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 120 

tgttgcgaac ggactttgtt ttgg 24 

<210> 121 - 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 121 

ttcaccagca atcggtggtg g 21 

<210> 122 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 122 

ctagaaatag cttctgaaat tctcc 25 

<210> 123 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 123 

cggcatatga tatctacagt caca 24 

<210> 124 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 124 

tcaatttggt tgctgcgaac aggac 25 



-35- 



